Machine Learning Times
Machine Learning Times
EXCLUSIVE HIGHLIGHTS
Video – Credit Models, Microfinance, and Improving the Lives of Families in the Developing World
 Event: Machine Learning Week 2021 Keynote: Credit Models, Microfinance, and...
Video – Identifying Program Effectiveness for Survivors of Human Trafficking from Muneeb Alam of QuantumBlack
 Event: Machine Learning Week 2021 Keynote: Identifying Program Effectiveness for Survivors...
Video – How to Use AI Ethically from Natalia Modjeska of Omdia
 Event: Machine Learning Week 2021 Keynote: How to Use AI...
Video – Alexa On The Edge – A Case Study in Customer-Obsessed Research from Susanj of Amazon
 Event: Machine Learning Week 2021 Keynote: Alexa On The Edge...

language models

Using DeepSpeed and Megatron to Train Megatron-Turing NLG 530B, the World’s Largest and Most Powerful Generative Language Model

 Originally published in Microsoft Research Blog, Oct 11, 2021 We are excited to introduce the DeepSpeed- and Megatron-powered Megatron-Turing Natural Language Generation model (MT-NLG), the largest and the most powerful monolithic transformer language model trained to date, with 530 billion parameters. It is the result of a research collaboration between Microsoft and NVIDIA to further