[feat] reuse existing tests and remove the error config of reward models by ETOgaosion · Pull Request #1 · zzong2006/verl

ETOgaosion · 2025-05-23T09:04:48Z

Checklist Before Starting

Search for similar PR(s).

What does this PR do?

reuse existing tests and remove the error config of reward models

High-Level Design

Demonstrate the high-level design if this PR is complex.

Specific Changes

List the specific changes.

API

Demonstrate how the API changes if any.

Usage Example

Provide usage example(s) for easier usage.

# Add code snippet or script demonstrating how to use this

Test

For changes that can not be tested by CI (e.g., algorithm implementation, new model support), validate by experiment(s) and show results like training curve plots, evaluatuion results, etc.

Additional Info.

Issue Number: Fixes issue # or discussion # if any.
Training: [Note which backend this PR will affect: FSDP, Megatron, both, or none]
Inference: [Note which backend this PR will affect: vLLM, SGLang, both, or none]

Checklist Before Submitting

Read the Contribute Guide.
Apply pre-commit checks.
Add [BREAKING] to the PR title if it breaks any API.
Update the documentation about your changes in the docs.
Add CI test(s) if necessary.

reuse existing tests

a501f6a

ETOgaosion mentioned this pull request May 23, 2025

[Megatron] Support optimizer offload for moe when ep > 1 verl-project/verl#1638

Merged

6 tasks

zzong2006 approved these changes May 23, 2025

View reviewed changes

zzong2006 merged commit 4105358 into zzong2006:support_moe_offload_when_ep_more_than_one May 23, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[feat] reuse existing tests and remove the error config of reward models#1

[feat] reuse existing tests and remove the error config of reward models#1
zzong2006 merged 1 commit intozzong2006:support_moe_offload_when_ep_more_than_onefrom
ETOgaosion:support_moe_offload_when_ep_more_than_one

ETOgaosion commented May 23, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

ETOgaosion commented May 23, 2025

Checklist Before Starting

What does this PR do?

High-Level Design

Specific Changes

API

Usage Example

Test

Additional Info.

Checklist Before Submitting

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants