FB pixel

Google patents method of matching voices to speakers’ faces in video

Categories Biometric R&D  |  Biometrics News
 

A patent filed by Google for an automated method of matching faces to voices in videos has been published by the World Intellectual Property Organization.

The patent, which was originally filed in April of last year, describes a computer-implemented method for speech diarization, in which a convolutional neural network is used to recognize faces, and a machine learning model is applied to segments of speech to detect different speakers. Wikipedia describes speaker diarization as a process of partitioning an audio input stream into homogenous segments according to speaker identity.

“The content system detects speech sounds in the audio track of the video, and clusters these speech sounds by individual distinct voice,” inventors Sourish Chaudhuri and Kenneth Hoover write in the application. “The content system further identifies faces in the video, and clusters these faces by individual distinct faces. The content system correlates the identified voices and faces to match each voice to each face. By correlating voices with faces, the content system is able to provide captions that accurately represent on-screen and off-screen speakers.”

Google researchers also published a paper earlier this year detailing an audio-visual method for using AI to separate speech from different individuals, mimicking the “cocktail party effect.”

Article Topics

 |   |   |   | 

Latest Biometrics News

 

Hawaii ID issue shows interoperability matters as digital IDs scale

By Albert Roux, EVP Product for Microblink Travelers at Hawaii airports recently experienced delays because valid state-issued IDs could not…

 

State Department moves to buy Clearview AI licenses for Colombia police

The U.S. State Department’s Bureau of International Narcotics and Law Enforcement (INL) at the U.S. Embassy in Bogotá, Colombia is…

 

Meta licensed ROC facial recognition, liveness for smart glasses project

Meta’s development of facial recognition for its smart glasses is drawing sharper scrutiny after reporting that the company licensed technology…

 

UK aims to lead the world with new age restrictions for social media, AI chatbots

After months of promises, the UK government has pulled the trigger on regulations to restrict social media sites for children…

 

Germany moves to allow police facial recognition searches of online images

Europe’s largest internet industry association, eco, has warned against Germany’s plan to allow its law enforcement agencies to run automated…

 

US senators propose curbs on AI-generated election deception

A group of Senate Democrats Thursday renewed a push to regulate the use of AI in federal elections, targeting both…

Comments

Leave a Reply

This site uses Akismet to reduce spam. Learn how your comment data is processed.

Biometric Market Analysis and Buyer's Guides

Most Viewed This Week

Featured Company

Biometrics Insight, Opinion

Digital ID In-Depth

Biometrics White Papers

Biometrics Events