Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

working with custom input data #38

Open
zeydabadi opened this issue Dec 8, 2022 · 3 comments
Open

working with custom input data #38

zeydabadi opened this issue Dec 8, 2022 · 3 comments

Comments

@zeydabadi
Copy link

Hi,
Suppose that the input data consists of 10000 images, and each image is of size MxN.
For end-to-end feature extraction and clustering scenario, what should be the size of train.pt tensor?

Also, how can we provide labels for custom data? let's say the possible labels are "positive", "negative", and "unknown".

Thanks

@meitarronen
Copy link
Contributor

Hi, if you are using the feature extraction end-to-end format, train.pt should be of size 10000XM*N (e.g., if M = 10, N = 20, 10000X200).
You don't need to provide labels, this is an unsupervised method :)
Simply supply train_data.pt and test_data.pt

@zeydabadi
Copy link
Author

Thanks for your response. I asked about labels so I could evaluate the clustering outcomes. How would you evaluate your end-to-end feature extraction and clustering algorithm if you don't have labels?

@meitarronen
Copy link
Contributor

Hey, you are able to look at our paper for a more detailed answer but in short: our algorithm is intended for cases where labels are not available (i.e., an unsupervised task). There are common unsupervised metrics used in literature such as the silhouette score, and many more.
If you have labels you can use them for evaluation (as we compared ourselves in our paper on supervised datasets) using the --use_labels_for_eval flag

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants