-
NVIDIA
- San Jose
- https://erhoo82.github.io/about/
Popular repositories Loading
-
apex
apex PublicForked from NVIDIA/apex
A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch
Python 1
-
NeMo-Megatron-Launcher
NeMo-Megatron-Launcher PublicForked from NVIDIA/NeMo-Framework-Launcher
NeMo Megatron launcher and tools
Python 1
-
TransformerEngine
TransformerEngine PublicForked from NVIDIA/TransformerEngine
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper GPUs, to provide better performance with lower memory utilization in bot…
Python 1
-
-
Megatron-LM
Megatron-LM PublicForked from NVIDIA/Megatron-LM
Ongoing research training transformer models at scale
Python 1
-
219 contributions in the last year
Day of Week | March Mar | April Apr | May May | June Jun | July Jul | August Aug | September Sep | October Oct | November Nov | December Dec | January Jan | February Feb | |||||||||||||||||||||||||||||||||||||||||
Sunday Sun | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Monday Mon | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Tuesday Tue | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Wednesday Wed | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Thursday Thu | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Friday Fri | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Saturday Sat |
Contribution activity
February 2025
Created 5 commits in 1 repository
Created a pull request in NVIDIA/NeMo that received 2 comments
interface for asymmetric pipeline schedule
What does this PR do ?
Add the interface for the asymmetric pipeline schedule
Changelog
Introduce account_for_embedding_in_pipeline_split
and acco…
Opened 3 other pull requests in 2 repositories
NVIDIA/NeMo
2
merged
-
Update README.md
This contribution was made on Feb 20
-
Fix num nodes to match parallel mappings
This contribution was made on Feb 13
NVIDIA/TransformerEngine
1
open
-
Support vectorized local reduction for p2p-based ReduceScatter overlap
This contribution was made on Feb 4
Reviewed 7 pull requests in 1 repository
NVIDIA/NeMo
7 pull requests
-
Add docs on env vars
This contribution was made on Feb 24
-
Perf script fix
This contribution was made on Feb 20
-
Malay/bw scripts
This contribution was made on Feb 10
-
Add performance-optimized example for llama2 70b LoRA
This contribution was made on Feb 7
-
numactl cmd
This contribution was made on Feb 6
-
Recipe changes for performance
This contribution was made on Feb 5
-
[MoE] fix run err in mixtral22B recipe and update its perf config
This contribution was made on Feb 3