refactor(processor): enhance tensor device handling in normalization processor by AdilZouitine · Pull Request #1795 · huggingface/lerobot

AdilZouitine · 2025-08-28T11:05:49Z

Issue and Fix Summary

The NormalizerProcessor had a critical device compatibility issue that broke Accelerate scenarios and multi-GPU setups.
The original logic attempted to move input tensors to a fixed processor device, but this failed when the processor device was None or when Accelerate placed tensors on specific GPUs (cuda:0, cuda:1, etc.). This caused "Expected all tensors to be on the same device" errors because normalization stats remained on CPU while input tensors were on GPU.

The fix reverses this approach: instead of forcing input tensors to match the processor device, we now dynamically move the stats tensors to match the input tensor's device using the existing _convert_stats_to_tensors function. This preserves Accelerate's device placement decisions, ensures all tensor operations happen on the same device, and makes the processor truly device-agnostic for distributed training scenarios.

…process

* refactor(processor): enhance tensor device handling in normalization process (#1795) * refactor(tests): remove unsupported device detection test for complementary data (#1797) * chore(tests): update ToBatchProcessor test (#1798) * refactor(tests): remove in-place mutation tests for actions and complementary data in batch processor * test(tests): add tests for action and task processing in batch processor * add names for android and ios phone (#1799) * use _tensor_stats in normalize processor (#1800) * fix(normalize_processor): correct device reference for tensor epsilon handling (#1801) * add point 5 add missing feature contracts (#1806) * Fix PR comments 1452 (#1807) * use key to determine image * Address rest of PR comments * use PolicyFeatures in transform_features --------- Co-authored-by: Pepijn <138571049+pkooij@users.noreply.github.com>

* [Port codebase pipeline] General fixes for RL and scripts (#1748) * Refactor dataset configuration in documentation and codebase - Updated dataset configuration keys from `dataset_root` to `root` and `num_episodes` to `num_episodes_to_record` for consistency. - Adjusted replay episode handling by renaming `episode` to `replay_episode`. - Enhanced documentation - added specific processor to transform from policy actions to delta actions * Added Robot action to tensor processor Added new processor script for dealing with gym specific action processing * removed RobotAction2Tensor processor; imrpoved choosing observations in actor * nit in delta action * added missing reset functions to kinematics * Adapt teleoperate and replay to pipeline similar to record * refactor(processors): move to inheritance (#1750) * fix(teleoperator): improvements phone implementation (#1752) * fix(teleoperator): protect shared state in phone implementation * refactor(teleop): separate classes in phone * fix: solve breaking changes (#1753) * refactor(policies): multiple improvements (#1754) * refactor(processor): simpler logic in device processor (#1755) * refactor(processor): euclidean distance in delta action processor (#1757) * refactor(processor): improvements to joint observations processor migration (#1758) * refactor(processor): improvements to tokenizer migration (#1759) * refactor(processor): improvements to tokenizer migration * fix(tests): tokenizer tests regression from #1750 * fix(processors): fix float comparison and config in hil processors (#1760) * chore(teleop): remove unnecessary callbacks in KeyboardEndEffectorTeleop (#1761) * refactor(processor): improvements normalize pipeline migration (#1756) * refactor(processor): several improvements normalize processor step * refactor(processor): more improvements normalize processor * refactor(processor): more changes to normalizer * refactor(processor): take a different approach to DRY * refactor(processor): final design * chore(record): revert comment and continue deleted (#1764) * refactor(examples): pipeline phone examples (#1769) * refactor(examples): phone teleop + teleop script * refactor(examples): phone replay + replay * chore(examples): rename phone example files & folders * feat(processor): fix improvements to the pipeline porting (#1796) * refactor(processor): enhance tensor device handling in normalization process (#1795) * refactor(tests): remove unsupported device detection test for complementary data (#1797) * chore(tests): update ToBatchProcessor test (#1798) * refactor(tests): remove in-place mutation tests for actions and complementary data in batch processor * test(tests): add tests for action and task processing in batch processor * add names for android and ios phone (#1799) * use _tensor_stats in normalize processor (#1800) * fix(normalize_processor): correct device reference for tensor epsilon handling (#1801) * add point 5 add missing feature contracts (#1806) * Fix PR comments 1452 (#1807) * use key to determine image * Address rest of PR comments * use PolicyFeatures in transform_features --------- Co-authored-by: Pepijn <138571049+pkooij@users.noreply.github.com> --------- Co-authored-by: Michel Aractingi <michel.aractingi@huggingface.co> Co-authored-by: Adil Zouitine <adilzouitinegm@gmail.com> Co-authored-by: Pepijn <138571049+pkooij@users.noreply.github.com>

refactor(processor): enhance tensor device handling in normalization …

553eebe

…process

AdilZouitine merged commit a50cb1c into fix/pipeline_development Aug 28, 2025
1 check passed

AdilZouitine deleted the fix/normalize_processor_step_device branch August 28, 2025 11:07

AdilZouitine mentioned this pull request Aug 28, 2025

feat(processor): fix improvements to the pipeline porting #1796

Merged

imstevenpmwork requested review from imstevenpmwork and removed request for imstevenpmwork August 31, 2025 18:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

refactor(processor): enhance tensor device handling in normalization processor#1795

refactor(processor): enhance tensor device handling in normalization processor#1795
AdilZouitine merged 1 commit intofix/pipeline_developmentfrom
fix/normalize_processor_step_device

AdilZouitine commented Aug 28, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Comments

Conversation

AdilZouitine commented Aug 28, 2025

Issue and Fix Summary

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant