FB pixel

Microsoft’s Project Oxford offers facial, image and speech-recognition APIs

 

Earlier this week, Microsoft quietly released a handful of new machine-learning APIs in beta under its Project Oxford program, while its How-Old.net demo for the service went viral a day after, according to a report by TechCrunch.

How-Old.net demonstrates how developers can upload photos of faces and the system automatically deciphers the age of the person in the photo.

As TechCrunch explains, the website “works reasonably well” with a “fair number of mistakes” and uses some of the new developer services offered under Project Oxford.

The new APIs enable developers to integrate face detection and recognition capabilities into their apps, while the service will attempt to calculate the user’s age and send the information to developers.

Multiple divisions within Microsoft collaborated to develop Oxford and the age-detection project, according to Ryan Galgon, a senior program manager on the Oxford project.

Additionally, the API offers face detection capabilities in images, face verification to determine whether two faces are, in fact, the same individual, and the ability to find similar-looking faces.

The API also includes speech recognition capabilities, which will soon be able to help developers to better under their user’s intent. The project also features a vision API for automatically categorizing images and creating smart image crops that always put the subject into the center of the cropped images.

Offer as a public beta, Microsoft will also add a fourth API that enables developers to integrate custom language understanding capabilities into their applications.

The Speech API feature speech-recognition services for speech-to-text conversion, a text-to-speech service that converts text into audio, and intent recognition that tries to understand the speaker’s intent, which is driven by the project’s Language Understanding Intelligent Service.

The image API enables developers to categorize images for the purpose of filtering out adult content or automatically applying tags to images or organizing them into clusters.

The API also offers optical character recognition capabilities, enabling developers to crop images automatically by determining the important aspects in an image and maintaining those components in the center of the photo as you crop it.

The service is currently free to use but Microsoft will eventually charge users for access, although the timeframe for this is still unclear.

Microsoft is currently offering a demo version that enables anyone to try out the service.

Article Topics

 |   |   |   |   | 

Latest Biometrics News

 

Biometric Update Podcast: Claire Ma explores the next phase of government digital identity

Governments around the world are moving toward digital identity systems, but not all are taking the same path. On the…

 

Trusted Caller ID with digital wallet and VCs improves call center authentication

Decentralized digital IDs shared from a digital wallet on a smartphone can significantly speed up identity verification by call centers,…

 

EES records 66M border crossings in first six months despite rollout friction

During its first six months of operation of Europe’s biometric-based Entry-Exit System (EES), daily fingerprint checks against EU databases rose…

 

IDDEEA outlines role of e-signatures in Bosnia’s digital transformation

Qualified electronic signatures (QES) have the potential to bring significant improvements to complex, fragmented public administrations like those in Bosnia…

 

Luxembourg opens tender for AI-generated content detection tool

Luxembourg’s Ministry of Digitalization has opened a call for solutions to develop a deepfake detection platform intended to support the…

 

Dutch court backs DigiD contract renewal amid U.S. CLOUD Act fears

A Dutch court has ruled that the government may extend its contract with Solvinity, a key infrastructure provider for the…

Comments

21 Replies to “Microsoft’s Project Oxford offers facial, image and speech-recognition APIs”

Leave a Reply

This site uses Akismet to reduce spam. Learn how your comment data is processed.

Biometric Market Analysis and Buyer's Guides

Most Viewed This Week

Featured Company

Biometrics Insight, Opinion

Digital ID In-Depth

Biometrics White Papers

Biometrics Events