Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
30 commits
Select commit Hold shift + click to select a range
87e6e95
migrate ICPC changes to IOI
SeanNaren Nov 18, 2025
bafb942
update IOI
SeanNaren Nov 19, 2025
52294b8
update IOI
SeanNaren Nov 19, 2025
96ad261
update IOI
SeanNaren Nov 19, 2025
d6247db
Merge branch 'main' into feat/update_ioi
SeanNaren Nov 20, 2025
df13f48
Merge branch 'main' into feat/update_ioi
SeanNaren Nov 21, 2025
d1d154a
Merge branch 'main' into feat/update_ioi
SeanNaren Nov 22, 2025
1dd2359
add input test support
SeanNaren Nov 22, 2025
b1e5515
Merge branch 'feat/update_ioi' of https://github.com/NVIDIA-NeMo/Skil…
SeanNaren Nov 22, 2025
868b070
update IOI
SeanNaren Nov 25, 2025
89773fe
round scores
SeanNaren Nov 26, 2025
cf643f8
fix naming
SeanNaren Nov 26, 2025
3c9d7b6
fix naming
SeanNaren Nov 26, 2025
3fa1c7b
update
SeanNaren Nov 27, 2025
62b82b3
update
SeanNaren Nov 27, 2025
b39bfd5
update
SeanNaren Nov 27, 2025
d432370
update
SeanNaren Dec 1, 2025
0f8f69a
update
SeanNaren Dec 1, 2025
65c08bb
update
SeanNaren Dec 1, 2025
d82fdc7
update
SeanNaren Dec 2, 2025
27505a5
update
SeanNaren Dec 3, 2025
90f6044
Merge branch 'main' into feat/update_ioi
SeanNaren Dec 4, 2025
f654d27
Merge branch 'main' into feat/update_ioi
SeanNaren Dec 14, 2025
e8f7f6b
update ioi text
SeanNaren Dec 14, 2025
0412a44
update clustering logic in metric
SeanNaren Dec 15, 2025
6d1d588
Merge branch 'main' into feat/update_ioi
SeanNaren Dec 16, 2025
520848a
add sha back
SeanNaren Dec 16, 2025
725f6a7
Merge branch 'feat/update_ioi' of https://github.com/NVIDIA/NeMo-Skil…
SeanNaren Dec 16, 2025
a12e07f
Merge branch 'main' into feat/update_ioi
mehrzads Dec 16, 2025
8c864ed
Merge branch 'main' into feat/update_ioi
SeanNaren Dec 17, 2025
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
5 changes: 4 additions & 1 deletion .github/workflows/tests.yml
Original file line number Diff line number Diff line change
Expand Up @@ -85,7 +85,10 @@ jobs:
NVIDIA_API_KEY: ${{ secrets.NVIDIA_API_KEY }}
HF_TOKEN: ${{ secrets.HF_TOKEN }}
run: |
docker run --rm --network=host nemo-skills-sandbox-image &
# Default shared runtime directory
sudo mkdir -p /nemo_run
sudo chmod 777 /nemo_run
docker run --rm --network=host -v /nemo_run:/nemo_run nemo-skills-sandbox-image &
sleep 10
set -o pipefail # this will make sure next line returns non-0 exit code if tests fail
ns prepare_data gsm8k math-500
Expand Down
18 changes: 9 additions & 9 deletions docs/evaluation/code.md
Original file line number Diff line number Diff line change
Expand Up @@ -185,10 +185,10 @@ We currently support IOI24 and are working to support IOI25 for evaluation. The

#### Data Preparation

First, prepare the dataset by running the `ns prepare_data` command. The arguments below will generate `test.jsonl` and `test_metadata.json`.
First, prepare the dataset by running the `ns prepare_data` command. The arguments below will generate `ioi24.jsonl` and `ioi24_metadata.json`.

```
ns prepare_data ioi24
ns prepare_data ioi
```

#### Running the Evaluation
Expand All @@ -209,24 +209,24 @@ ns eval \
--server_gpus=8 \
--benchmarks=ioi24:50 \
--with_sandbox \
--split=test \
--split=ioi24 \
--data_dir=<DATA_DIR> \
--output_dir=<OUTPUT_DIR> \
--extra_eval_args="++eval_config.test_file=<PATH_TO_METADATA_TEST_FILE>" \
--eval_subfolder=eval-results/ioi24/ \ # set the folder if you want to differentiate subsets.
--extra_eval_args="++eval_config.test_file=<PATH_TO_METADATA_TEST_DIR>/ioi24_metadata.json" \
++inference.temperature=0.6 \
++inference.top_p=0.95 \
++inference.tokens_to_generate=65536
```

##### Verifying Results

After all jobs are complete, you can check the results in `<OUTPUT_DIR>/eval-results/ioi24/metrics.json`. You can also take a look at `<OUTPUT_DIR>/eval-results/ioi24/summarized-results/main_*`. They should look something like this:
After all jobs are complete, you can check the results in `<OUTPUT_DIR>/eval-results/ioi24/ioi/metrics.json`. You can also take a look at `<OUTPUT_DIR>/eval-results/ioi24/ioi/summarized-results/main_*`. They should look something like this:

```
------------------------------------------------------ ioi24 ------------------------------------------------------
evaluation_mode | num_entries | avg_tokens | gen_seconds | correct | total_score | round_robin_score
pass@1[avg-of-50] | 39 | 40387 | 7410 | 0.51% ± 1.04% | 303.47 | 261.01
pass@50 | 39 | 40387 | 7410 | 2.56% | 303.47 | 261.01
------------------------------------ ioi24 -------------------------------------
evaluation_mode | num_entries | avg_tokens | gen_seconds | correct | total_score
pass@50 | 39 | 52225 | 99630 | 23.08% | 500
```

### livecodebench
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -27,6 +27,7 @@
if __name__ == "__main__":
parser = argparse.ArgumentParser()
parser.add_argument("--split", type=str, default="test")
parser.add_argument("--suffix", type=str, default="24")
args = parser.parse_args()

data_dir = Path(__file__).absolute().parent
Expand All @@ -50,7 +51,7 @@
}
)

with open(os.path.join(data_dir, f"{args.split}.jsonl"), "w") as f:
with open(os.path.join(data_dir, f"ioi{args.suffix}.jsonl"), "w") as f:
f.write("\n".join(json.dumps(x) for x in entries))

tests_dataset = load_dataset("open-r1/ioi-test-cases", name="2024", split="train")
Expand Down Expand Up @@ -82,5 +83,5 @@
"grader_files": entry["grader_files"],
}

with open(os.path.join(data_dir, f"{args.split}_metadata.json"), "w") as f:
with open(os.path.join(data_dir, f"ioi{args.suffix}_metadata.json"), "w") as f:
json.dump(final_structure, f)
31 changes: 0 additions & 31 deletions nemo_skills/dataset/ioi25/__init__.py

This file was deleted.

Loading