SFT pipeline integrated for aicrowd/ChessExplained dataset #35

michael-L-i · 2025-11-18T07:32:50Z

Code tailored to SFT training on Dipam's dataset using Qwen3-1.7B. The Colab is a similar replica to the Qwen SFT example already present, but adjusted for our chess case.

After installing dependencies, simply run the same torchrun command pointing to the finetune_chess_model.py file. In that file, you can control the train/eval dataset accordingly.

This pipeline I verified by training a small 5000 position dataset on a trn1.2xlarge instance with relatively good results already.

EmilyWebber · 2025-11-18T13:04:57Z

Awesome, thanks Mike!

jlonge4

The one issue is that the max_model_len from the compilation and inference should match (512!=2048). Otherwise LGTM!

EmilyWebber

Hey Mike - we need the max_model_len from the compilation and inference to match (512!=2048)

labs/FineTuning/HuggingFaceExample/01_finetuning/FT-Qwen3-1.7B-chess.ipynb

init code for SFT on Dipam's dataset formatted for model input

b59a3c0

EmilyWebber requested a review from jlonge4 November 18, 2025 17:56

jlonge4 reviewed Nov 18, 2025

View reviewed changes

EmilyWebber requested changes Nov 19, 2025

View reviewed changes

ensure max_model_length/sequence matches in compilation and inference

c8cbc43

michael-L-i changed the title ~~init code for SFT on Dipam's dataset formatted for model input~~ SFT pipeline integrated for aicrowd/ChessExplained dataset Nov 19, 2025

jlonge4 approved these changes Nov 19, 2025

View reviewed changes

labs/FineTuning/HuggingFaceExample/01_finetuning/FT-Qwen3-1.7B-chess.ipynb Outdated Show resolved Hide resolved

EmilyWebber merged commit b95f725 into aws-neuron:main Nov 20, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

SFT pipeline integrated for aicrowd/ChessExplained dataset #35

SFT pipeline integrated for aicrowd/ChessExplained dataset #35

michael-L-i commented Nov 18, 2025

Uh oh!

EmilyWebber commented Nov 18, 2025

Uh oh!

jlonge4 left a comment •

edited

Loading

Uh oh!

EmilyWebber left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

SFT pipeline integrated for aicrowd/ChessExplained dataset #35

SFT pipeline integrated for aicrowd/ChessExplained dataset #35

Conversation

michael-L-i commented Nov 18, 2025

Uh oh!

EmilyWebber commented Nov 18, 2025

Uh oh!

jlonge4 left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

EmilyWebber left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

jlonge4 left a comment •

edited

Loading