Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Refactor dataset so that when we upsample we get different masks for different queries of the same data. #126

Open
jstjohn opened this issue Aug 28, 2024 · 1 comment

Comments

@jstjohn
Copy link
Collaborator

jstjohn commented Aug 28, 2024

No description provided.

@jstjohn
Copy link
Collaborator Author

jstjohn commented Aug 28, 2024

Also note that the ESM2 case also samples items from clusters. This is another great reason to have special behavior handled in the dataset to control how to handle with querying beyond the number of items in self (clusters in the case of ESM2 or cells in the case of geneformer).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant