Loki said:amherst,
And since the result of this process doesn't produce a simple graph that directly maps 'high hit rates' to 'high standardness' and 'low hit rates' to 'low standardness', then the decisions on what constitutes 'standardness' tells us what?
The paper lists all 40 experiments, their rated standardness, and their hit rates. Experiments rated at high-standardness had high hit rates. Experiments rated at high non-standardness had low hit rates.
amherst