Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Batch integration] Isolated labels broken for pancreas data #757

Closed
scottgigante-immunai opened this issue Dec 17, 2022 · 0 comments · Fixed by #770
Closed

[Batch integration] Isolated labels broken for pancreas data #757

scottgigante-immunai opened this issue Dec 17, 2022 · 0 comments · Fixed by #770
Assignees
Labels
bug Something isn't working

Comments

@scottgigante-immunai
Copy link
Collaborator

The only isolated label is t_cell which only has seven cells. This doesn't seem like a good metric.

>>> import openproblems
>>> adata = openproblems.tasks.batch_integration_embed.datasets.pancreas_batch()
>>> from scib.metrics.isolated_labels import *
>>> isolated_labels = get_isolated_labels(
...         adata, label_key="labels", batch_key="batch", iso_threshold=None, verbose=True
...     )
isolated labels: no more than 4 batches per label
>>> isolated_labels
['t_cell']
>>> adata.obs['labels'].value_counts()
alpha                 5493
beta                  4169
ductal                2142
acinar                1669
delta                 1055
gamma                  699
activated_stellate     464
endothelial            313
quiescent_stellate     193
macrophage              79
mast                    42
epsilon                 32
schwann                 25
t_cell                   7
Name: labels, dtype: int64
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

Successfully merging a pull request may close this issue.

3 participants