[Port codebase pipeline] General fixes for RL and scripts by michel-aractingi · Pull Request #1748 · huggingface/lerobot

michel-aractingi · 2025-08-17T22:07:13Z

RL Pipeline Integration and Processor Improvements

This PR fixes many points related to the Reinforcement learning scripts (gym_manipulator.py) and the associated pipelines.

Completes the pipeline integration by migrating teleoperate.py and replay.py to use configurable processor chains similar to the record.py.

Adds a new GymActionProcessor for RL workflows
improves the DeltaActionProcessor.
includes missing reset functions for robot kinematics.
enhances observation selection in the RL actor based on input_features.

NOTE these changes have been tested heavily in sim and real robot

- Updated dataset configuration keys from `dataset_root` to `root` and `num_episodes` to `num_episodes_to_record` for consistency. - Adjusted replay episode handling by renaming `episode` to `replay_episode`. - Enhanced documentation - added specific processor to transform from policy actions to delta actions

Added new processor script for dealing with gym specific action processing

…in actor

* [Port codebase pipeline] General fixes for RL and scripts (#1748) * Refactor dataset configuration in documentation and codebase - Updated dataset configuration keys from `dataset_root` to `root` and `num_episodes` to `num_episodes_to_record` for consistency. - Adjusted replay episode handling by renaming `episode` to `replay_episode`. - Enhanced documentation - added specific processor to transform from policy actions to delta actions * Added Robot action to tensor processor Added new processor script for dealing with gym specific action processing * removed RobotAction2Tensor processor; imrpoved choosing observations in actor * nit in delta action * added missing reset functions to kinematics * Adapt teleoperate and replay to pipeline similar to record * refactor(processors): move to inheritance (#1750) * fix(teleoperator): improvements phone implementation (#1752) * fix(teleoperator): protect shared state in phone implementation * refactor(teleop): separate classes in phone * fix: solve breaking changes (#1753) * refactor(policies): multiple improvements (#1754) * refactor(processor): simpler logic in device processor (#1755) * refactor(processor): euclidean distance in delta action processor (#1757) * refactor(processor): improvements to joint observations processor migration (#1758) * refactor(processor): improvements to tokenizer migration (#1759) * refactor(processor): improvements to tokenizer migration * fix(tests): tokenizer tests regression from #1750 * fix(processors): fix float comparison and config in hil processors (#1760) * chore(teleop): remove unnecessary callbacks in KeyboardEndEffectorTeleop (#1761) * refactor(processor): improvements normalize pipeline migration (#1756) * refactor(processor): several improvements normalize processor step * refactor(processor): more improvements normalize processor * refactor(processor): more changes to normalizer * refactor(processor): take a different approach to DRY * refactor(processor): final design * chore(record): revert comment and continue deleted (#1764) * refactor(examples): pipeline phone examples (#1769) * refactor(examples): phone teleop + teleop script * refactor(examples): phone replay + replay * chore(examples): rename phone example files & folders * feat(processor): fix improvements to the pipeline porting (#1796) * refactor(processor): enhance tensor device handling in normalization process (#1795) * refactor(tests): remove unsupported device detection test for complementary data (#1797) * chore(tests): update ToBatchProcessor test (#1798) * refactor(tests): remove in-place mutation tests for actions and complementary data in batch processor * test(tests): add tests for action and task processing in batch processor * add names for android and ios phone (#1799) * use _tensor_stats in normalize processor (#1800) * fix(normalize_processor): correct device reference for tensor epsilon handling (#1801) * add point 5 add missing feature contracts (#1806) * Fix PR comments 1452 (#1807) * use key to determine image * Address rest of PR comments * use PolicyFeatures in transform_features --------- Co-authored-by: Pepijn <138571049+pkooij@users.noreply.github.com> --------- Co-authored-by: Michel Aractingi <michel.aractingi@huggingface.co> Co-authored-by: Adil Zouitine <adilzouitinegm@gmail.com> Co-authored-by: Pepijn <138571049+pkooij@users.noreply.github.com>

…e#1748) * Refactor dataset configuration in documentation and codebase - Updated dataset configuration keys from `dataset_root` to `root` and `num_episodes` to `num_episodes_to_record` for consistency. - Adjusted replay episode handling by renaming `episode` to `replay_episode`. - Enhanced documentation - added specific processor to transform from policy actions to delta actions * Added Robot action to tensor processor Added new processor script for dealing with gym specific action processing * removed RobotAction2Tensor processor; imrpoved choosing observations in actor * nit in delta action * added missing reset functions to kinematics * Adapt teleoperate and replay to pipeline similar to record

michel-aractingi added 6 commits August 11, 2025 15:39

Added Robot action to tensor processor

53ace28

Added new processor script for dealing with gym specific action processing

removed RobotAction2Tensor processor; imrpoved choosing observations …

e8b8d57

…in actor

nit in delta action

62f716d

added missing reset functions to kinematics

f65e74a

Adapt teleoperate and replay to pipeline similar to record

fe7c368

imstevenpmwork changed the base branch from user/azouitine/2025-7-4-convert-codebase-with-pipeline to feat/pipeline_development August 18, 2025 08:15

imstevenpmwork merged commit 1290a77 into feat/pipeline_development Aug 18, 2025
1 check passed

imstevenpmwork deleted the user/michel-aractingi/2025-8-11-rl_fixes branch August 18, 2025 08:19

imstevenpmwork mentioned this pull request Aug 18, 2025

feat(processor): multiple improvements to the pipeline porting #1749

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Port codebase pipeline] General fixes for RL and scripts#1748

[Port codebase pipeline] General fixes for RL and scripts#1748
imstevenpmwork merged 6 commits intofeat/pipeline_developmentfrom
user/michel-aractingi/2025-8-11-rl_fixes

michel-aractingi commented Aug 17, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

michel-aractingi commented Aug 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

RL Pipeline Integration and Processor Improvements

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

michel-aractingi commented Aug 17, 2025 •

edited

Loading