Back to researchers

Patrick LeGresley

Model-parallel training at scale (Megatron-LM)

Co-authored Megatron-LM: a core reference for scaling transformer training via model parallelism.

Highlights

SystemsTrainingScaling
Focus: Model-parallel training at scale (Megatron-LM)
Why it matters: Co-authored Megatron-LM: a core reference for scaling transformer training via model parallelism.

Research Areas

SystemsTrainingScaling
Patrick LeGresley - AI Researcher Profile | 500AI