Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

train & eval scripts #26

Merged
merged 5 commits into from
Nov 26, 2024
Merged

train & eval scripts #26

merged 5 commits into from
Nov 26, 2024

Conversation

lannelin
Copy link
Collaborator

No description provided.

@lannelin
Copy link
Collaborator Author

Hi @eddableheath @J-Dymond . I've run this locally for 2 epochs to train a model which gets zero one acc: 0.23 hamming acc: 0.92 on validation set (though not with seed set :( . )
Let me know if you'd like to review and then run on baskerville - possibly with slightly tweaked hparams - or prefer me/one of you to run this on baskerville first.

Copy link
Collaborator

@J-Dymond J-Dymond left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good! I think it would be a good idea to train it for longer on Baskerville. This performs better than the zero-shot, as I'm sure we expected, so I can work to get this into the current pipelines. We also have the more difficult levels of topic label which we can use, though we could probably move to training on those labels that at a later stage?

@lannelin
Copy link
Collaborator Author

this has now run on baskerville and trained model available in project dir under distilbert_topic_classifier/.
I've made a couple of small changes in latest commits to fix the readme and to allow for loading only a single split of the data

@lannelin lannelin requested a review from J-Dymond November 26, 2024 11:33
@lannelin
Copy link
Collaborator Author

closes #23

Copy link
Collaborator

@J-Dymond J-Dymond left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

These changes look good to me, will merge them into inference pipeline branch before I open a pull request.

@lannelin lannelin merged commit 419add8 into main Nov 26, 2024
5 checks passed
@lannelin lannelin deleted the 23-train-topic-classifier branch November 29, 2024 16:57
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants