Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Exclusion list for datasets #306

Open
wiederm opened this issue Oct 31, 2024 · 0 comments
Open

Exclusion list for datasets #306

wiederm opened this issue Oct 31, 2024 · 0 comments
Labels
enhancement New feature or request

Comments

@wiederm
Copy link
Member

wiederm commented Oct 31, 2024

Feature Request: Exclude Data Points via exclusion Tag in dataset.toml

It would be helpful to have a way to exclude specific data points from the dataset by specifying them in an exclusion tag within the dataset.toml file. This tag would define a list of entries to skip when loading the dataset.

Proposed Implementation:
To avoid affecting the dataset split, this exclusion process could be implemented immediately after the dataset split step. This approach would allow the split to remain unaffected by the excluded entries while still filtering out unwanted data points. In that way the exclusion happens during the dataset setup phase and would have no impact on the dataset caching, which would be benefictial.

@wiederm wiederm added the enhancement New feature or request label Oct 31, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant