Skip to content

Prepare for transformers v5 upgrade#3272

Merged
winglian merged 47 commits into
mainfrom
transformers-v5
Jan 27, 2026
Merged

Prepare for transformers v5 upgrade#3272
winglian merged 47 commits into
mainfrom
transformers-v5

Conversation

@winglian

@winglian winglian commented Nov 20, 2025

Copy link
Copy Markdown
Collaborator

Summary by CodeRabbit

  • Chores
    • Expanded build pipeline support for new dependency versions: CUDA 12.8.1, Python 3.11, PyTorch 2.9.0, and PyTorch 2.9.1. Updated both standard and cloud build configurations.

✏️ Tip: You can customize this high-level summary in your review settings.

@coderabbitai

coderabbitai Bot commented Nov 20, 2025

Copy link
Copy Markdown
Contributor

Important

Review skipped

Auto incremental reviews are disabled on this repository.

Please check the settings in the CodeRabbit UI or the .coderabbit.yaml file in this repository. To trigger a single review, invoke the @coderabbitai review command.

You can disable this status message by setting the reviews.review_status to false in the CodeRabbit configuration file.

  • 🔍 Trigger a full review
📝 Walkthrough

Walkthrough

Modifies .github/workflows/main.yml to add matrix entries for CUDA 12.8.1 configurations to both build-axolotl and build-axolotl-cloud jobs, introducing two PyTorch versions (2.9.0 and 2.9.1) with Python 3.11.

Changes

Cohort / File(s) Summary
Workflow Matrix Configuration
\.github/workflows/main\.yml
Adds two new matrix entries (PyTorch 2.9.0 and 2.9.1) with CUDA 12.8.1 and Python 3.11 to both build-axolotl and build-axolotl-cloud jobs

Estimated Code Review Effort

🎯 1 (Trivial) | ⏱️ ~3 minutes

  • Simple configuration matrix additions with no logic changes
  • Repetitive, homogeneous pattern across both jobs
  • No structural or behavioral modifications to workflow logic

Possibly Related PRs

  • #2784: Modifies the same GitHub Actions workflow matrices for CUDA 12.8.1 PyTorch configurations in build jobs
  • #3049: Updates GitHub Actions workflow matrix in .github/workflows/main.yml for the build-axolotl-cloud job
  • #3221: Adds matching CUDA 12.8.1 / Python 3.11 / PyTorch 2.9.0 matrix entries to CI workflow configurations

Suggested Reviewers

  • NanoCode012
  • SalmanMohammadi

Pre-merge checks and finishing touches

❌ Failed checks (1 warning)
Check name Status Explanation Resolution
Title check ⚠️ Warning The PR title 'Prepare for transformers v5 upgrade' does not match the actual changes, which only add CUDA 12.8.1 and PyTorch 2.9.x matrix entries to CI workflows. Update the title to accurately reflect the changes, such as 'Add CUDA 12.8.1 and PyTorch 2.9.x matrix entries to build workflows' or clarify how these CI changes relate to transformers v5 preparation.
✅ Passed checks (2 passed)
Check name Status Explanation
Description Check ✅ Passed Check skipped - CodeRabbit’s high-level summary is enabled.
Docstring Coverage ✅ Passed No functions found in the changed files to evaluate docstring coverage. Skipping docstring coverage check.

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

Comment @coderabbitai help to get the list of available commands and usage tips.

@github-actions

github-actions Bot commented Nov 20, 2025

Copy link
Copy Markdown
Contributor

📖 Documentation Preview: https://697908f5e23666577cd750fa--resonant-treacle-0fd729.netlify.app

Deployed on Netlify from commit 875ec43

Comment thread .github/workflows/tests.yml
Comment thread tests/hf_offline_utils.py
@winglian winglian force-pushed the transformers-v5 branch 4 times, most recently from 6ac8f13 to 9767e28 Compare December 3, 2025 13:29
Comment thread requirements.txt Outdated
tokenizers>=0.22.1
transformers==4.57.1
transformers @ git+https://github.com/huggingface/transformers.git@main
accelerate==1.11.0

@salmanmohammadi salmanmohammadi Dec 3, 2025

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think we need to bump the accelerate version @winglian

@NanoCode012

Copy link
Copy Markdown
Collaborator

Require #3318

@winglian winglian force-pushed the transformers-v5 branch 2 times, most recently from 0341778 to c0d0fac Compare December 31, 2025 03:11
winglian and others added 20 commits January 22, 2026 10:56
more fdsp_Version fixes for v5
fix version in fsdp_config
fix aliasing
fix fsdp_version check
check fsdp_version is 2 in both places
* bump dep

* use latest fbgemm, grab model config as part of fixture, un-skip test

* import AutoConfig

* don't need more problematic autoconfig when specifying config.json manually

* add fixtures for argilla ultrafeedback datasets

* download phi4-reasoning

* fix arg

* update tests for phi fast tokenizer changes

* use explicit model types for gemma3

---------

Co-authored-by: Wing Lian <wing@axolotl.ai>
@winglian winglian merged commit fc4e379 into main Jan 27, 2026
23 of 24 checks passed
@winglian winglian deleted the transformers-v5 branch January 27, 2026 22:08
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants