megatron-lm

Minimal yet high performant code for pretraining llms. Attempts to implement some SOTA features. Implements training through: Deepspeed, Megatron-LM, and FSDP. WIP

huggingface pretraining deepspeed megatron-lm llm fsdp

Updated Feb 6, 2024
Python

Beomi / megatronlm_dataset_autotokenizer

Star

Megatron-LM/GPT-NeoX compatible Text Encoder with 🤗Transformers AutoTokenizer.

transformers gpt-neox tokenizers megatron-lm

Updated Nov 16, 2023
Python

janelu9 / EasyLLM

Star

Running Large Language Model easily.

llama fine-tuning npu pretrain deepspeed megatron-lm qwen qwen-vl

Updated Mar 18, 2025
Jupyter Notebook

GJ98 / Megatron-LM

Star

Megatron-LM implemented by PyTorch

nlp pytorch megatron-lm

Updated May 27, 2023
Python

0-1CxH / megatron-wrap

Star

Wrapped Megatron: As User-Friendly as HuggingFace, As Powerful as Megatron-LM | Megatron封装：和HuggingFace一样方便，和Megatron-LM一样强大

wrapper simplification transformer wrap llama gpt sft megatron huggingface-transformers deepspeed megatron-lm llm llm-training

Updated Dec 23, 2024
Python

Improve this page

Add a description, image, and links to the megatron-lm topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the megatron-lm topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

megatron-lm

Here are 12 public repositories matching this topic...

alibaba / Megatron-LLaMA

openpsi-project / ReaLHF

shreyansh26 / Annotated-ML-Papers

xrsrke / pipegoose

feifeibear / Odysseus-Transformer

MoFHeka / LLaMA-Megatron

GoogleCloudPlatform / nvidia-nemo-on-gke

SulRash / minLLMTrain

Beomi / megatronlm_dataset_autotokenizer

janelu9 / EasyLLM

GJ98 / Megatron-LM

0-1CxH / megatron-wrap

Improve this page

Add this topic to your repo