M

Megatron-LM

Ongoing research training transformer language models at scale, including: BERT & GPT-2