-
Notifications
You must be signed in to change notification settings - Fork 358
Nathan add logging to metrics #157
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
However, do you have an example of a parquet file saved with this new method? Under which key are these added info stored?
yeah i do, in the detail file, they are added in the
|
Hm would there be a way to put them elsewhere? Like in an "additional info" field? |
i don't really see why, that's where i wanted to put them in the first place. judgement prompt is tightly related to the metric |
What else does the metric field contain in your above file? |
nothing, the only metric we are using for mt_bench is llm_as_judge |
Ok, that works for now! If we use this system more in the future for extra logging, we might need to move some values to another field of details |
what this PR does: If you want to log out something comming from the metrics, simply return it in the metric dict. for example, if you want to log out the judge response when using llm_as_judge, simply return the response in the dict. ``` { "score": score, "judgement": judge_response } ```` the `judgement` field is a string and will not be aggregated. however, it will be logged in the details for each sample. --------- Co-authored-by: Nathan Habib <[email protected]>
what this PR does: If you want to log out something comming from the metrics, simply return it in the metric dict. for example, if you want to log out the judge response when using llm_as_judge, simply return the response in the dict. ``` { "score": score, "judgement": judge_response } ```` the `judgement` field is a string and will not be aggregated. however, it will be logged in the details for each sample. --------- Co-authored-by: Nathan Habib <[email protected]>
what this PR does:
If you want to log out something comming from the metrics, simply return it in the metric dict.
for example, if you want to log out the judge response when using llm_as_judge, simply return the response in the dict.
the
judgement
field is a string and will not be aggregated. however, it will be logged in the details for each sample.