FB pixel

Microsoft and Nvidia partner up on speech recognition model training

Microsoft and Nvidia partner up on speech recognition model training
 

Microsoft and Nvidia have announced a new collaboration focusing on the training of artificial intelligence (AI)-powered natural language processing (NLP) models, Venture Beat reports.

Specifically, the companies said they trained the Megatron-Turing Natural Language Generation (MT-NLP) system, which can perform various speech recognition-related tasks, including reading comprehension, common sense reasoning, and natural language inferences.

Building on the firms’ Turing NLG 17B and Megatron-LM models, MT-NLP reportedly contains 530 billion parameters and can achieve ‘unmatched’ accuracy levels.

Nvidia’s senior director of product management and marketing for accelerated computing, Paresh Kharya, and group program manager for the Microsoft Turing team, Ali Alvi recently wrote a blog post on the company’s website, claiming the new technology will shape the future of NLP.

“The journey is long and far from complete, but we are excited by what is possible and what lies ahead,” the technology experts wrote.

From a technological standpoint, MT-NLP was trained using a dataset with 270 billion tokens from English-language websites, most of which came from The Pile’s 850GB collection.

According to the blog post, model training was done using the Nvidia DGX SuperPOD-based Selene supercomputer powered by 560 DGX A100 servers networked with HDR InfiniBand in a full fat tree configuration. For context, each DGX A100 has, in turn, eight Nvidia A100 80GB Tensor Core GPUs, fully connected to each other by NVLink and NVSwitch.

While MT-NLP is arguably the largest and most capable AI-powered language model to date, however, Microsoft and Nvidia confirmed the system “pick[ed] up stereotypes and biases from the data on which it [was] trained.”

Biases in AI and biometric recognition are a known problem in the industry, but the companies said they are currently working to address the problem.

“We encourage continued research to help in quantifying the bias of the model,” they confirmed in the blog post.

Nvidia has been working steadily on NLP and voice biometrics in the last few years. In December 2020, the company announced a partnership with Veritone, and last August, Nvidia spoke with Biometric Update about the potential of voice biometrics for financial services applications.

Article Topics

 |   |   |   |   | 

Latest Biometrics News

 

Cybastion to support digital infrastructure development in DRC

U.S. digital ID and cybersecurity firm Cybastion will deploy its technology and expertise in support of the Democratic Republic of…

 

Tanzania seeks biometrics contractors for Phase II of national digital ID project

Tanzania says it is seeking contractors for some activities related to the execution of Phase II of the country’s national…

 

Smart glasses and the new DHS surveillance budget

The Department of Homeland Security’s (DHS) Fiscal Year (FY) 2027 budget justification lays out an expansive biometric and identity tech…

 

Voice AI expands attack surface for speaker biometrics as APIs proliferate

Deepfake voices are already a challenge for authentication systems. But the task is getting tougher, as big players pursue voice…

 

NetChoice wins in Arkansas, but faces forever war against age assurance

The battle over age assurance legislation in the United States has reached its next level. As the global tide turns…

 

UIDAI selects 20 bug bounty hunters to bolster India’s digital ID security

The Unique Identification Authority of India (UIDAI) has launched a structured bug bounty program. The authority will open its core…

Comments

Leave a Reply

This site uses Akismet to reduce spam. Learn how your comment data is processed.

Biometric Market Analysis and Buyer's Guides

Most Viewed This Week

Featured Company

Biometrics Insight, Opinion

Digital ID In-Depth

Biometrics White Papers

Biometrics Events