-
Notifications
You must be signed in to change notification settings - Fork 130
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add a random_benchmark() method #240
Comments
ChrisCummins
added a commit
to ChrisCummins/CompilerGym
that referenced
this issue
May 4, 2021
The v0.1.8 release removed the random benchmark selection from CompilerGym environments when no benchmark was specified. If the user wishes for random benchmark selection, they were required to roll their own implementation. Randomly sampling from env.dataset.benchmark_uris() is not always easy as the generator may be infinite. For some datasets, e.g. Csmith, it is trivial to select random benchmarks by generating random numbers within the range of numeric seed values, but this is not obvious and the user shouldn't have to figure this out for the simple case of uniform random selection. This adds a `random_benchmark()` method to the `Dataset` class which allows uniform random benchmark selection, and a `random_benchmark()` method to the `Datasets` class for sampling across datasets. Issue facebookresearch#240.
ChrisCummins
added a commit
to ChrisCummins/CompilerGym
that referenced
this issue
May 4, 2021
The v0.1.8 release removed the random benchmark selection from CompilerGym environments when no benchmark was specified. If the user wishes for random benchmark selection, they were required to roll their own implementation. Randomly sampling from env.dataset.benchmark_uris() is not always easy as the generator may be infinite. For some datasets, e.g. Csmith, it is trivial to select random benchmarks by generating random numbers within the range of numeric seed values, but this is not obvious and the user shouldn't have to figure this out for the simple case of uniform random selection. This adds a `random_benchmark()` method to the `Dataset` class which allows uniform random benchmark selection, and a `random_benchmark()` method to the `Datasets` class for sampling across datasets. Issue facebookresearch#240.
bwasti
pushed a commit
to bwasti/CompilerGym
that referenced
this issue
Aug 3, 2021
The v0.1.8 release removed the random benchmark selection from CompilerGym environments when no benchmark was specified. If the user wishes for random benchmark selection, they were required to roll their own implementation. Randomly sampling from env.dataset.benchmark_uris() is not always easy as the generator may be infinite. For some datasets, e.g. Csmith, it is trivial to select random benchmarks by generating random numbers within the range of numeric seed values, but this is not obvious and the user shouldn't have to figure this out for the simple case of uniform random selection. This adds a `random_benchmark()` method to the `Dataset` class which allows uniform random benchmark selection, and a `random_benchmark()` method to the `Datasets` class for sampling across datasets. Issue facebookresearch#240.
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
🚀 Feature
The v0.1.8 release removed the random benchmark selection from CompilerGym environments when no benchmark was specified. If the user wishes for random benchmark selection, they must now roll their own implementation. For users who want to select benchmarks randomly, we should provide a simple
Dataset.random_benchmark()
option.Motivation
Randomly sampling from
env.dataset.benchmark_uris()
is not always easy as the generator may be infinite. For some datasets, e.g. Csmith, it is trivial to select random benchmarks by generating random numbers within the range of numeric seed values, but this is not obvious and the user shouldn't have to figure this out for the simple case of uniform random selection.Pitch
Extend the dataset classes with a
random_benchmark()
method:This method can be implemented by subclasses to efficiently select a benchmark using the provided RNG.
Alternatives
We don't provide any randomness methods. We require that users first enumerate a finite set of benchmark URIs and then sample it. This has the advantage of making the users think explicitly about the random distributions they wish to use. The downside is that it is more complex to roll your own random selection, and most users probably just want a uniform selection anyway.
The text was updated successfully, but these errors were encountered: