Skip to content
Closed
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
7 changes: 6 additions & 1 deletion nemo_rl/models/megatron/community_import.py
Original file line number Diff line number Diff line change
Expand Up @@ -85,9 +85,14 @@ def export_model_from_megatron(
from nemo.tron.converter.qwen import HFQwen2Exporter

exporter_cls = HFQwen2Exporter

elif hf_config.model_type == "qwen3":
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
elif hf_config.model_type == "qwen3":
elif hf_config.model_type in ("qwen3", "qwen3_moe"):

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Ah, actually I just tested this and it looks like this doesn't work with MoE models at the moment. Could we extend this to support MoE? If too much work, we could always add MoE support in a follow up PR

from nemo.tron.converter.qwen import HFQwen3Exporter

exporter_cls = HFQwen3Exporter
else:
raise ValueError(
f"Unknown model: {hf_model_name}. Currently, only Qwen2 and Llama are supported. "
f"Unknown model: {hf_model_name}. Currently, only Qwen2, Qwen3 and Llama are supported. "
"If you'd like to run with a different model, please raise an issue or consider adding your own converter."
)
print(f"Exporting model {hf_model_name} to {output_path}...")
Expand Down
Loading