FB pixel

Microsoft and Nvidia partner up on speech recognition model training

Microsoft and Nvidia partner up on speech recognition model training
 

Microsoft and Nvidia have announced a new collaboration focusing on the training of artificial intelligence (AI)-powered natural language processing (NLP) models, Venture Beat reports.

Specifically, the companies said they trained the Megatron-Turing Natural Language Generation (MT-NLP) system, which can perform various speech recognition-related tasks, including reading comprehension, common sense reasoning, and natural language inferences.

Building on the firms’ Turing NLG 17B and Megatron-LM models, MT-NLP reportedly contains 530 billion parameters and can achieve ‘unmatched’ accuracy levels.

Nvidia’s senior director of product management and marketing for accelerated computing, Paresh Kharya, and group program manager for the Microsoft Turing team, Ali Alvi recently wrote a blog post on the company’s website, claiming the new technology will shape the future of NLP.

“The journey is long and far from complete, but we are excited by what is possible and what lies ahead,” the technology experts wrote.

From a technological standpoint, MT-NLP was trained using a dataset with 270 billion tokens from English-language websites, most of which came from The Pile’s 850GB collection.

According to the blog post, model training was done using the Nvidia DGX SuperPOD-based Selene supercomputer powered by 560 DGX A100 servers networked with HDR InfiniBand in a full fat tree configuration. For context, each DGX A100 has, in turn, eight Nvidia A100 80GB Tensor Core GPUs, fully connected to each other by NVLink and NVSwitch.

While MT-NLP is arguably the largest and most capable AI-powered language model to date, however, Microsoft and Nvidia confirmed the system “pick[ed] up stereotypes and biases from the data on which it [was] trained.”

Biases in AI and biometric recognition are a known problem in the industry, but the companies said they are currently working to address the problem.

“We encourage continued research to help in quantifying the bias of the model,” they confirmed in the blog post.

Nvidia has been working steadily on NLP and voice biometrics in the last few years. In December 2020, the company announced a partnership with Veritone, and last August, Nvidia spoke with Biometric Update about the potential of voice biometrics for financial services applications.

Article Topics

 |   |   |   |   | 

Latest Biometrics News

 

NADRA and NIRA work to advance Somalia’s digital identification program

Pakistan’s National Database and Registration Authority (NADRA) remains committed to helping Somalia reach new milestones in its national ID card…

 

Moldova plans distribution of biometric capture devices to its diplomatic missions

The Moldovan government has decided to facilitate the process of issuing passports and digital ID cards for its citizens abroad….

 

Romania finalizes formalities for digital ID, issuance begins March 20

Romania will begin issuing its new Electronic Identity Card (CEI) on Thursday March 20, one week after the government concluded…

 

As Trump’s AI deregulation, job cuts sink in, industry gets spooked

In January 2025, President Donald Trump issued Executive Order (EO) 14179, Removing Barriers to American Leadership in Artificial Intelligence. It…

 

Effective digital public services need strong ID tech foundation: Entrust

Digital public services are increasing their efficiency, as well as accessibility, which in turn increases inclusivity. Delivering them to people…

 

UK cybersecurity sector sees rise in 2024

The UK’s cyber security industry – which includes digital identification, authentication and access controls firms – has generated £13.2 billion…

Comments

Leave a Reply

This site uses Akismet to reduce spam. Learn how your comment data is processed.

Most Viewed This Week

Featured Company

Biometrics Insight, Opinion

Digital ID In-Depth

Biometrics White Papers

Biometrics Events