Skip to content

Commit 0ad4722

Browse files
authored
fix: lower steps in smolvlm nightly test (#1239)
Signed-off-by: Terry Kong <[email protected]>
1 parent d82ca75 commit 0ad4722

File tree

3 files changed

+5
-5
lines changed

3 files changed

+5
-5
lines changed

examples/configs/recipes/vlm/vlm_grpo-smolvlm2-2.2b-instruct-clevr-1n2g-dtensor2tp1.v1.yaml.disabled renamed to examples/configs/recipes/vlm/vlm_grpo-smolvlm2-2.2b-instruct-clevr-1n2g-dtensor2tp1.v2.yaml.disabled

File renamed without changes.

tests/test_suites/nightly.txt

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -24,7 +24,7 @@ tests/test_suites/vlm/vlm_grpo-qwen2.5-vl-3b-instruct-clevr-1n2g-dtensor2tp1.v1.
2424
tests/test_suites/vlm/vlm_grpo-qwen2.5-vl-3b-instruct-clevr-1n2g-megatrontp2.v1.sh
2525

2626
# Removing this until this issue is resolved: https://github.com/huggingface/transformers/issues/41190
27-
# tests/test_suites/vlm/vlm_grpo-smolvlm2-2.2b-instruct-clevr-1n2g-dtensor2tp1.v1.sh
27+
# tests/test_suites/vlm/vlm_grpo-smolvlm2-2.2b-instruct-clevr-1n2g-dtensor2tp1.v2.sh
2828

2929
# Deepscaler (short tests)
3030
tests/test_suites/llm/grpo-deepscaler-1.5b-16K.sh

tests/test_suites/vlm/vlm_grpo-smolvlm2-2.2b-instruct-clevr-1n2g-dtensor2tp1.v1.sh.disabled renamed to tests/test_suites/vlm/vlm_grpo-smolvlm2-2.2b-instruct-clevr-1n2g-dtensor2tp1.v2.sh.disabled

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -4,8 +4,8 @@ source $SCRIPT_DIR/common.env
44

55
# ===== BEGIN CONFIG =====
66
NUM_NODES=1
7-
STEPS_PER_RUN=200
8-
MAX_STEPS=200
7+
STEPS_PER_RUN=130
8+
MAX_STEPS=130
99
NUM_RUNS=$(( (MAX_STEPS + STEPS_PER_RUN - 1) / STEPS_PER_RUN )) # Round up
1010
NUM_MINUTES=180
1111
# ===== END CONFIG =====
@@ -34,7 +34,7 @@ uv run tests/json_dump_tb_logs.py $LOG_DIR --output_path $JSON_METRICS
3434
# Only run metrics if the target step is reached
3535
if [[ $(jq 'to_entries | .[] | select(.key == "train/loss") | .value | keys | map(tonumber) | max' $JSON_METRICS) -ge $MAX_STEPS ]]; then
3636
uv run tests/check_metrics.py $JSON_METRICS \
37-
'data["train/loss"]["200"] < 0.1' \
38-
'data["train/reward"]["200"] > 0.7' # less performant than qwen
37+
'data["train/loss"]["130"] < 0.1' \
38+
'mean(data["train/reward"], -6, -1) > 0.6' # less performant than qwen
3939
fi
4040

0 commit comments

Comments
 (0)