Microsoft’s Project Oxford offers facial, image and speech-recognition APIs

May 3, 2015, 4:23 pm EDT | Justin Lee

Earlier this week, Microsoft quietly released a handful of new machine-learning APIs in beta under its Project Oxford program, while its How-Old.net demo for the service went viral a day after, according to a report by TechCrunch.

How-Old.net demonstrates how developers can upload photos of faces and the system automatically deciphers the age of the person in the photo.

As TechCrunch explains, the website “works reasonably well” with a “fair number of mistakes” and uses some of the new developer services offered under Project Oxford.

The new APIs enable developers to integrate face detection and recognition capabilities into their apps, while the service will attempt to calculate the user’s age and send the information to developers.

Multiple divisions within Microsoft collaborated to develop Oxford and the age-detection project, according to Ryan Galgon, a senior program manager on the Oxford project.

Additionally, the API offers face detection capabilities in images, face verification to determine whether two faces are, in fact, the same individual, and the ability to find similar-looking faces.

The API also includes speech recognition capabilities, which will soon be able to help developers to better under their user’s intent. The project also features a vision API for automatically categorizing images and creating smart image crops that always put the subject into the center of the cropped images.

Offer as a public beta, Microsoft will also add a fourth API that enables developers to integrate custom language understanding capabilities into their applications.

The Speech API feature speech-recognition services for speech-to-text conversion, a text-to-speech service that converts text into audio, and intent recognition that tries to understand the speaker’s intent, which is driven by the project’s Language Understanding Intelligent Service.

The image API enables developers to categorize images for the purpose of filtering out adult content or automatically applying tags to images or organizing them into clusters.

The API also offers optical character recognition capabilities, enabling developers to crop images automatically by determining the important aspects in an image and maintaining those components in the center of the photo as you crop it.

The service is currently free to use but Microsoft will eventually charge users for access, although the timeframe for this is still unclear.

Microsoft is currently offering a demo version that enables anyone to try out the service.

Article Topics

Comments

21 Replies to “Microsoft’s Project Oxford offers facial, image and speech-recognition APIs”

BiometricUpdate says:

May 3, 2015 at 4:24 pm

Microsoft’s Project Oxford offers facial, image and speech-recognition APIs: http://t.co/wNOcH59wNu

Reply
grantlebrun says:

May 3, 2015 at 4:28 pm

RT @BiometricUpdate: Microsoft’s Project Oxford offers facial, image and speech-recognition APIs: http://t.co/wNOcH59wNu

Reply
HighTechPlanet says:

May 3, 2015 at 6:01 pm

Microsoft’s Project Oxford offers facial, image and speech-recognition APIs http://t.co/ylqMMJnhyQ

Reply
rogerkmoore says:

May 3, 2015 at 7:04 pm

Microsoft’s Project Oxford offers facial, image and speech-recognition APIs | BiometricUpdate http://t.co/7n1l9vUm7k #biometrics

Reply
EElectronicsCA says:

May 3, 2015 at 10:38 pm

Microsoft’s Project Oxford offers facial, image and speech-recognition APIs – Biometric Update http://t.co/Y8tY3cGGlC

Reply
EbayGizmoCA says:

May 3, 2015 at 10:38 pm

Microsoft’s Project Oxford offers facial, image and speech-recognition APIs – Biometric Update http://t.co/dUisk6MTME

Reply
EbayGadgetsCA says:

May 3, 2015 at 10:38 pm

Microsoft’s Project Oxford offers facial, image and speech-recognition APIs – Biometric Update http://t.co/FqtHRBXFOf

Reply
nsureio says:

May 3, 2015 at 11:26 pm

Microsoft’s Project Oxford offers facial, image and speech-recognition APIs – Biometric Update http://t.co/p2VqiT9nGr

Reply
7SealsOfTheEnd says:

May 3, 2015 at 11:29 pm

Microsoft’s Project Oxford offers facial, image and speech-recognition APIs | BiometricUpdate http://t.co/ksxPAz2xWU #biometrics

Reply
SautechLtda says:

May 4, 2015 at 12:00 am

Microsoft’s Project Oxford offers facial, image and speech-recognition APIs | BiometricUpdate http://t.co/rlyRwhvfCz #biometrics

Reply
Kintivo says:

May 4, 2015 at 2:09 am

Microsoft’s Project Oxford offers facial, image and speech-recognition APIs: Earlier this week, Microsoft quie… http://t.co/RNbhKqFIef

Reply
SeanKyleBordner says:

May 4, 2015 at 2:09 am

Microsoft’s Project Oxford offers facial, image and speech-recognition APIs: Earlier this week, Microsoft quie… http://t.co/dDi2jOVjMS

Reply
AV_SP says:

May 4, 2015 at 4:29 am

RT @rogerkmoore: Microsoft’s Project Oxford offers facial, image and speech-recognition APIs | BiometricUpdate http://t.co/7n1l9vUm7k #biom…

Reply
BeauRParry says:

May 4, 2015 at 10:59 am

Microsoft’s Project Oxford offers facial, image and speech-recognition APIs http://t.co/E8LKwnAZUb

Reply
KapturMag says:

May 4, 2015 at 2:06 pm

Microsoft’s Project Oxford offers facial, image and speech-recognition http://t.co/LyYtWzSGW8

Reply
BiometricJustin says:

May 4, 2015 at 2:10 pm

RT @BiometricUpdate: @Microsoft’s Project Oxford offers facial, image and speech-recognition APIs: http://t.co/2iLEGUbZA6

Reply
BiometricAmy says:

May 4, 2015 at 5:10 pm

RT @BiometricUpdate: @Microsoft’s Project Oxford offers facial, image and speech-recognition APIs: http://t.co/O6NM3HrjlC

Reply
BiometricAlli says:

May 4, 2015 at 8:10 pm

RT @BiometricUpdate: @Microsoft’s Project Oxford offers facial, image and speech-recognition APIs: http://t.co/KxAm0JhieJ

Reply
CultumAnnotata says:

May 5, 2015 at 12:30 am

RT @BiometricUpdate: Microsoft’s Project Oxford offers facial, image and speech-recognition APIs: http://t.co/wNOcH59wNu

Reply
SuhaJber says:

May 5, 2015 at 5:27 am

http://t.co/Jsc0UXbTUu #biometrics

Reply
Chef296 says:

May 5, 2015 at 2:24 pm

http://t.co/uIb421RKi0

Reply

Microsoft’s Project Oxford offers facial, image and speech-recognition APIs

Article Topics

Comments

21 Replies to “Microsoft’s Project Oxford offers facial, image and speech-recognition APIs”

Leave a ReplyCancel reply

Biometric Market Analysis and Buyer's Guides

Most Viewed This Week

Featured Company

Biometrics Insight, Opinion

Digital ID In-Depth

Biometrics White Papers

Biometrics Events

Microsoft’s Project Oxford offers facial, image and speech-recognition APIs

Article Topics

Latest Biometrics News

Biometric Update Podcast: Claire Ma explores the next phase of government digital identity

Trusted Caller ID with digital wallet and VCs improves call center authentication

EES records 66M border crossings in first six months despite rollout friction

IDDEEA outlines role of e-signatures in Bosnia’s digital transformation

Luxembourg opens tender for AI-generated content detection tool

Dutch court backs DigiD contract renewal amid U.S. CLOUD Act fears

Comments

21 Replies to “Microsoft’s Project Oxford offers facial, image and speech-recognition APIs”

Leave a ReplyCancel reply

Biometric Market Analysis and Buyer's Guides

Most Viewed This Week

Featured Company

Biometrics Insight, Opinion

Digital ID In-Depth

Biometrics White Papers

Biometrics Events