-
Notifications
You must be signed in to change notification settings - Fork 367
Add extended task for LiveCodeBench codegeneration #548
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
Changes from all commits
Commits
Show all changes
23 commits
Select commit
Hold shift + click to select a range
9369a37
Add draft for livecodebench code generation
plaguss 2001e7b
Add extra argument version_tag
plaguss fece552
Fix import name
plaguss e46fc2a
Remove unused typed dict
plaguss 6a3c007
Checkpoint, not ready yet, try simplifying code running and reuse pas…
plaguss 987eb2a
Add notes for expected values
plaguss 42fb0f5
Pass version tag to downloader
plaguss b700dc4
Modify helper module and remove dataset version tag
plaguss 29b2bbe
Remove version_tag
plaguss a60e662
Initial version for lcb:codegeneration
plaguss 05a7f01
Remove outdated argument docs
plaguss deea663
Remove hardcoded system prompt and pass it via arg
plaguss a2863f9
Merge branch 'main' into lcb-codegeneration
plaguss 44f45b5
Add kwargs to allow passing other arguments
plaguss 127b4cd
Make generic function to parse the metric name and obtain the number …
plaguss a372e05
Change metric name to make it more informative
plaguss 53ab417
Add experimental way of passing the number of samples for a metric fr…
plaguss f6a7c4f
Add more processes to run the tests
plaguss d6abcd0
Allow reading the generation parameters from the CLI
plaguss 158d660
Update parsing arguments from CLI
plaguss 54fa032
Remove dead code and fix test value
plaguss 4a0fe89
Fix num_samples update
plaguss f945fdf
Add docs for the new metric_options
plaguss File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
can you add some docs for this ?