FB pixel

IBM plans massive data set to improve facial recognition and reduce AI bias

 

IBM is planning to make the largest facial attribute and identity training set in the world, with more than a million images, available this fall to help improve the training of artificial intelligence facial recognition systems and reduce bias in algorithms.

The dataset, which IBM says in a blog post is five times the size of the largest one currently available, will be annotated with attribute and identity information, with images drawn from different countries using Flickr geo-tags, and sample selection bias mitigated with active learning tools. While currently available datasets include attributes, such as hair color, or tags identifying that multiple images are of the same person, the new set from IBM will include both. A dataset with 36,000 facial images evenly distributed among ethnicities, genders, and ages will also be released specifically to help identify and address bias.

“As the adoption of AI increases, the issue of preventing bias from entering into AI systems is rising to the judgement, intuition and expertise. The power of advanced innovations, like AI, lies in their ability to augment, not replace, human decision-making. It is therefore critical that any organization using AI — including visual recognition or video analysis capabilities — train the teams working with it to understand bias, including implicit and unconscious bias, monitor for it, and know how to address it.”

IBM showed earlier this year that the error rate of its Watson Visual Recognition service for facial analysis has been decreased nearly ten-fold, according to the post.

IBM is one of the facial recognition providers whose algorithms were tested by M.I.T. Media Lab Researcher Joy Buolamwini when she found major differences in error rates between people of different populations earlier this year.

Microsoft, another of the leading facial recognition providers with algorithms demonstrating bias in the same test, just announced a dramatic improvement in its facial recognition algorithm’s ability to recognize the gender of people with darker skin tones, as it attempts to deal with the same issue.

Article Topics

 |   |   |   |   | 

Latest Biometrics News

 

Biometrics projects scale to meet great expectations, from borders to payments

Biometrics projects are graduating to production, reaching scale milestones and expanding dramatically in the top stories of the week on…

 

ICE using data and probability to decide where to detain and arrest people

U.S. Immigration and Customs Enforcement’s Enhanced Leads Identification & Targeting for Enforcement (ELITE) tool is being used to identify “targets”…

 

In AI era, identity is about governance, Microblink’s Hartley Thompson tells BU Podcast

“One of the defining things in my life is change,” says Hartley Thompson of Microblink. “How do you react to…

 

CLR Labs wins funding to support biometrics, IAD, digital wallet standardization

Cabinet Louis Reynaud (CLR Labs) has won funding from a French government program to support its standardization efforts in biometrics,…

 

Checkr crossed $800M gross in 2025 as biometric background checks expand

Biometric background check provider Checkr is celebrating 2025 as its most successful year ever, with gross revenue surpassing $800 million…

 

Identity and risk infrastructure startup secures $12M for Europe, LATAM expansion

Monnai, which provides identity and risk data infrastructure, has announced a 12 million dollar equity funding round led by Motive…

Comments

Leave a Reply

This site uses Akismet to reduce spam. Learn how your comment data is processed.

Biometric Market Analysis and Buyer's Guides

Most Viewed This Week

Featured Company

Biometrics Insight, Opinion

Digital ID In-Depth

Biometrics White Papers

Biometrics Events