megatron-lm

Minimal yet high performant code for pretraining llms. Attempts to implement some SOTA features. Implements training through: Deepspeed, Megatron-LM, and FSDP. WIP

huggingface pretraining deepspeed megatron-lm llm fsdp

Updated Feb 6, 2024
Python

Beomi / megatronlm_dataset_autotokenizer

Star

Megatron-LM/GPT-NeoX compatible Text Encoder with 🤗Transformers AutoTokenizer.

transformers gpt-neox tokenizers megatron-lm

Updated Nov 16, 2023
Python

GJ98 / Megatron-LM

Star

Megatron-LM implemented by PyTorch

nlp pytorch megatron-lm

Updated May 27, 2023
Python

Improve this page

Add a description, image, and links to the megatron-lm topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the megatron-lm topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

megatron-lm

Here are 8 public repositories matching this topic...

alibaba / Megatron-LLaMA

shreyansh26 / Annotated-ML-Papers

xrsrke / pipegoose

MoFHeka / LLaMA-Megatron

GoogleCloudPlatform / nvidia-nemo-on-gke

SulRash / minLLMTrain

Beomi / megatronlm_dataset_autotokenizer

GJ98 / Megatron-LM

Improve this page

Add this topic to your repo