Releases: NVIDIA/NeMo-Curator
Releases · NVIDIA/NeMo-Curator
NVIDIA NeMo Curator 0.8.0rc3.dev0
Prerelease: NVIDIA NeMo Curator 0.8.0rc3.dev0 (2025-04-15)
NVIDIA NeMo Curator 0.8.0rc2.dev0
Prerelease: NVIDIA NeMo Curator 0.8.0rc2.dev0 (2025-04-07)
NVIDIA NeMo Curator 0.7.1
NVIDIA NeMo Curator 0.7.0
- Python 3.12 Support
- Curator on Blackwell
- Nemotron-CC Dataset Recipe
- Performant S3 for Fuzzy Deduplication
NVIDIA NeMo Curator 0.7.0rc2.dev0
Prerelease: NVIDIA NeMo Curator 0.7.0rc2.dev0 (2025-02-25)
NVIDIA NeMo Curator 0.7.0rc1.dev1
Prerelease: NVIDIA NeMo Curator 0.7.0rc1.dev1 (2025-02-19)
NVIDIA NeMo Curator 0.7.0rc0.dev1
Prerelease: NVIDIA NeMo Curator 0.7.0rc0.dev1 (2025-02-04)
NVIDIA NeMo Curator 0.6.0
What's changed
- Synthetic Data Generation for Text Retrieval
- LLM-based Filters
- Easiness
- Answerability
- Q&A Retrieval Generation Pipeline
- LLM-based Filters
- Parallel Dataset Curation for Machine Translation
- Load/Write Bitext Files
- Heuristic filtering (Histogram, Length Ratio)
- Classifier filtering (Comet, Cometoid)
NVIDIA NeMo Curator 0.6.0rc2.dev1
Prerelease: NVIDIA NeMo Curator 0.6.0rc2.dev1 (2025-01-03)
NVIDIA NeMo Curator 0.6.0rc1.dev1
Prerelease: NVIDIA NeMo Curator 0.6.0rc1.dev1 (2024-12-20)