-
Notifications
You must be signed in to change notification settings - Fork 104
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Indexer / Clustering hyperparameter tuning #238
Comments
Also Can you help me point me difference between preliminary_indexer_params and refined_indexer_params? |
XR-Transformer will construct two HLTs:
|
For the document on hyper parameters see doc. |
Is it correct to under sparse meaning tf-idf and dense is more of word embeddings/vectors? My precision at 5 is in 60s and recall is 80s, so my understanding is ranker params need to be tuned to improve precision, is it correct understanding?
Do you mean Machine learning matcher by encoder or the encoder model that encodes the text? Also, what would be recommendation on playing with right splits for clustering for matcher and ranker? what would be good way to debug precision errors like printing cluster? Is it possible test ranker alone programmatically or indexer/matcher/ranker separately for understanding to tune? |
Hello,
Thank you for wonderful work. I am trying to train PECOS on a custom dataset and would like to understand indexer (example parameters below) hyperparameters. Is there any description of the hyperparameters one can explore? Also, it will be nice to understand each parameter.
train_params.preliminary_indexer_params.max_leaf_size = 380
train_params.preliminary_indexer_params.nr_splits = 2
train_params.refined_indexer_params.nr_splits = 2
train_params.ranker_params.nr_splits = 2
The text was updated successfully, but these errors were encountered: