Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
6 changes: 3 additions & 3 deletions README.md
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
# NeMo Gym

NeMo Gym is a framework for building reinforcement learning (RL) training environments for large language models (LLMs). It provides infrastructure to develop environments, scale rollout collection, and integrate seamlessly with your preferred training framework.
NeMo Gym is a library for building reinforcement learning (RL) training environments for large language models (LLMs). It provides infrastructure to develop environments, scale rollout collection, and integrate seamlessly with your preferred training framework.

NeMo Gym is a component of the [NVIDIA NeMo Framework](https://docs.nvidia.com/nemo-framework/), NVIDIA’s GPU-accelerated platform for building and training generative AI models.

Expand All @@ -22,7 +22,7 @@ NeMo Gym is a component of the [NVIDIA NeMo Framework](https://docs.nvidia.com/n

NeMo Gym is designed to run on standard development machines:

- **GPU**: Not required for NeMo Gym framework operation
- **GPU**: Not required for NeMo Gym library operation
- GPU may be needed for specific resource servers or model inference (see individual server documentation)
- **CPU**: Any modern x86_64 or ARM64 processor (e.g., Intel, AMD, Apple Silicon)
- **RAM**: Minimum 8 GB (16 GB+ recommended for larger environments)
Expand Down Expand Up @@ -145,7 +145,7 @@ If you use NeMo Gym in your research, please cite it using the following BibTeX

```bibtex
@misc{nemo-gym,
title = {NeMo Gym: An Open Source Framework for Scaling Reinforcement Learning Environments for LLM},
title = {NeMo Gym: An Open Source Library for Scaling Reinforcement Learning Environments for LLM},
howpublished = {\url{https://github.com/NVIDIA-NeMo/Gym}},
author={NVIDIA},
year = {2025},
Expand Down
4 changes: 2 additions & 2 deletions docs/get-started/setup-installation.md
Original file line number Diff line number Diff line change
Expand Up @@ -23,7 +23,7 @@

NeMo Gym is designed to run on standard development machines without specialized hardware:

- **GPU**: Not required for NeMo Gym framework operation
- **GPU**: Not required for NeMo Gym library operation
- GPU may be needed for specific resource servers or model inference (see individual server documentation). E.g. if you are intending to train your model with NeMo-RL, GPU resources are required (see training documentation)
- **CPU**: Any modern x86_64 or ARM64 processor (e.g., Intel, AMD, Apple Silicon)
- **RAM**: Minimum 8 GB (16 GB+ recommended for larger environments and datasets)
Expand Down Expand Up @@ -327,7 +327,7 @@ Your directory should look like this:
Gym/
├── env.yaml # Your API credentials (git-ignored)
├── .venv/ # Virtual environment (git-ignored)
├── nemo_gym/ # Core framework code
├── nemo_gym/ # Core library code
├── resources_servers/ # Tools and environments
├── responses_api_models/ # Model integrations
├── responses_api_agents/ # Agent implementations
Expand Down
2 changes: 1 addition & 1 deletion docs/index.md
Original file line number Diff line number Diff line change
Expand Up @@ -2,7 +2,7 @@

# NeMo Gym Documentation

[NeMo Gym](https://github.com/NVIDIA-NeMo/Gym) is a framework for building reinforcement learning (RL) training environments for large language models (LLMs). It provides infrastructure to develop environments, scale rollout collection, and integrate seamlessly with your preferred training framework.
[NeMo Gym](https://github.com/NVIDIA-NeMo/Gym) is a library for building reinforcement learning (RL) training environments for large language models (LLMs). It provides infrastructure to develop environments, scale rollout collection, and integrate seamlessly with your preferred training framework.

A training environment consists of three server components: **Agents** orchestrate the rollout lifecycle—calling models, executing tool calls via resources, and coordinating verification. **Models** provide stateless text generation using LLM inference endpoints. **Resources** define tasks, tool implementations, and verification logic.

Expand Down