You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
We may want to encourage experimenters to repeat trials on instances depending on the observed success, see also #1117 and #1978.
If some instances are repeated much more frequently than others, the significance test will mainly test the difference on these instances. This seems to be rather undesirable.
We can not easily stratify the significance test, because data sets do not need to be run on the same instances. We could however take the median of each instance to conduct the test, or another representative subsample. A subsample choice that also works if experiments have been done on a single instance seems preferable (even if such experiments should not be considered a valid data set).
The text was updated successfully, but these errors were encountered:
A somewhat related issue: when within-trial independent restarts are superseded by experimental repetitions, the $p$-values as they currently calculated will often consider only a comparatively small budget and only reflect relatively easy targets/problems. Should also $p$-values be calculated with simulated restarts?
We may want to encourage experimenters to repeat trials on instances depending on the observed success, see also #1117 and #1978.
If some instances are repeated much more frequently than others, the significance test will mainly test the difference on these instances. This seems to be rather undesirable.
We can not easily stratify the significance test, because data sets do not need to be run on the same instances. We could however take the median of each instance to conduct the test, or another representative subsample. A subsample choice that also works if experiments have been done on a single instance seems preferable (even if such experiments should not be considered a valid data set).
The text was updated successfully, but these errors were encountered: