feat: add gradient accumulation support #2646

irisTa56 · 2025-12-13T16:39:53Z

What this does

Adds gradient accumulation support to the training script.
This allows simulating larger batch sizes without increasing GPU memory usage, which is useful when training large models or when memory is limited.

Added gradient_accumulation_steps configuration parameter to TrainPipelineConfig (default: 1)
Updated update_policy() to use accelerator.accumulate() context manager
Updated effective batch size calculation in logging and MetricsTracker
Training loop now skips evaluation and checkpointing during gradient accumulation steps (i.e., treats step as an optimizer step)

How it was tested

Added tests in tests/training/test_update_policy.py.
- test_update_policy_sync_gradients: Verifies gradient sync behavior
- test_update_policy_gradient_accumulation_equivalence: Validates mathematical equivalence
Added test_metrics_tracker_step_with_accelerator in tests/utils/test_logging_utils.py

P.S. I have removed those tests since we’re using the Accelerate API, and testing it is out of the scope of this PR. But they can be seen at d5df208 and d365eb8.

How to checkout & try? (for the reviewer)

# Effective batch size: 8 × 1 × 4 = 32
lerobot-train \
  --dataset.repo_id=lerobot/svla_so101_pickplace \
  --policy.type=act \
  --policy.repo_id=foo/my_policy \
  --policy.push_to_hub=false \
  --batch_size=8 \
  --gradient_accumulation_steps=4 \
  --log_freq=1 \
  --steps=50

Copilot

Pull request overview

This PR adds gradient accumulation support to the training pipeline, enabling simulation of larger batch sizes without increasing GPU memory usage. The implementation leverages Accelerator's built-in gradient accumulation features.

Key changes:

Added gradient_accumulation_steps configuration parameter (default: 1) to control how many batches to accumulate before performing an optimizer step
Updated training loop to properly handle gradient synchronization, skipping evaluation/checkpointing during accumulation steps
Modified effective batch size calculations throughout the codebase to account for gradient accumulation

Reviewed changes

Copilot reviewed 5 out of 5 changed files in this pull request and generated 2 comments.

Show a summary per file

File	Description
src/lerobot/configs/train.py	Adds `gradient_accumulation_steps` configuration parameter with documentation
src/lerobot/scripts/lerobot_train.py	Updates training loop and `update_policy()` to use Accelerator's accumulation context, removes unused `lock` parameter, and adds gradient sync checks
src/lerobot/utils/logging_utils.py	Updates `MetricsTracker.step()` to calculate effective batch size including gradient accumulation
tests/training/test_update_policy.py	Adds comprehensive tests for gradient sync behavior and mathematical equivalence
tests/utils/test_logging_utils.py	Adds test for `MetricsTracker` with gradient accumulation

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

src/lerobot/scripts/lerobot_train.py

tests/training/test_update_policy.py

…accumulation

…cy function

Remove lock parameter and related changes that are outside the scope of gradient accumulation feature. This keeps the branch focused on its primary topic and reduces unnecessary diff from main.

jadechoghari · 2026-01-01T05:51:04Z

Hey @irisTa56 that's a very useful feature! I did it locally to train bigger models. Could you maybe resolve the conflicts with main, so we kickoff a review?
Thanks!

…mulation

irisTa56 · 2026-01-01T11:59:39Z

@jadechoghari
Thanks for the feedback! I've resolved the merge conflicts with main and also added code comments for the newly introduced RA-BC feature. The PR should be ready for review now.

jadechoghari · 2026-01-01T12:48:40Z

Thanks! @irisTa56, the tests in this PR seem unnecessary since we’re using the accelerate api, and testing this functionality is the responsibility of accelerate itself. Removing the accelerate-related tests would make the PR cleaner and shorter 😄

irisTa56 · 2026-01-01T13:31:53Z

Thanks! @irisTa56, the tests in this PR seem unnecessary since we’re using the accelerate api, and testing this functionality is the responsibility of accelerate itself. Removing the accelerate-related tests would make the PR cleaner and shorter 😄

Thanks for the feedback! I agree and have removed the tests.

tests/utils/test_logging_utils.py

irisTa56 and others added 4 commits December 13, 2025 21:43

refactor: simplify update_policy function and metrics handling

f7d7533

feat: add gradient accumulation support

5c85bf9

Revert some modifications to respect the existing code

6529fdf

Merge branch 'huggingface:main' into gradient_accumulation

b7b8002

irisTa56 marked this pull request as ready for review December 13, 2025 16:50

Copilot AI review requested due to automatic review settings December 13, 2025 16:50

Copilot AI reviewed Dec 13, 2025

View reviewed changes

src/lerobot/scripts/lerobot_train.py Outdated Show resolved Hide resolved

tests/training/test_update_policy.py Outdated Show resolved Hide resolved

irisTa56 and others added 5 commits December 14, 2025 02:12

fix: ensure lr_scheduler steps only when gradients are synced during …

b727b8f

…accumulation

refactor: improve readability of gradient update logic in update_poli…

05c28f6

…cy function

refactor: rename variable for clarity in gradient accumulation test

06b047b

revert removing lock parameter to minimize scope changes

f74c493

Remove lock parameter and related changes that are outside the scope of gradient accumulation feature. This keeps the branch focused on its primary topic and reduces unnecessary diff from main.

Merge branch 'main' into gradient_accumulation

1733383

jadechoghari added the training Issues related at training time label Jan 1, 2026

irisTa56 added 2 commits January 1, 2026 18:14

Merge branch 'main' into gradient_accumulation

63257a5

Add note about RA-BC weight normalization behavior with gradient accu…

26bcdf3

…mulation

github-actions bot added tests Problems with test coverage, failures, or improvements to testing configuration Problems with configuration files or settings labels Jan 1, 2026

jadechoghari self-requested a review January 1, 2026 12:45

Remove accelerate-related tests as they are accelerate's responsibility

d5df208

jadechoghari reviewed Jan 2, 2026

View reviewed changes

tests/utils/test_logging_utils.py Outdated Show resolved Hide resolved

Remove accelerator-related extra test for metrics tracker

d365eb8

github-actions bot removed the tests Problems with test coverage, failures, or improvements to testing label Jan 3, 2026

Merge branch 'main' into gradient_accumulation

833fe2a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: add gradient accumulation support #2646

feat: add gradient accumulation support #2646

irisTa56 commented Dec 13, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

jadechoghari commented Jan 1, 2026 •

edited

Loading

Uh oh!

irisTa56 commented Jan 1, 2026

Uh oh!

jadechoghari commented Jan 1, 2026

Uh oh!

irisTa56 commented Jan 1, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

feat: add gradient accumulation support #2646

Are you sure you want to change the base?

feat: add gradient accumulation support #2646

Conversation

irisTa56 commented Dec 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What this does

How it was tested

How to checkout & try? (for the reviewer)

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

jadechoghari commented Jan 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

irisTa56 commented Jan 1, 2026

Uh oh!

jadechoghari commented Jan 1, 2026

Uh oh!

irisTa56 commented Jan 1, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

irisTa56 commented Dec 13, 2025 •

edited

Loading

jadechoghari commented Jan 1, 2026 •

edited

Loading