FB pixel

Datatang adds 5,000 Traditional Chinese characters to OCR system

Categories Biometrics News  |  Trade Notes
Datatang adds 5,000 Traditional Chinese characters to OCR system
 

Beijing-based artificial intelligence (AI) company Datatang has updated its optical character recognition (OCR) database to include 5,000 handwritten characters in Traditional Chinese.

In a dedicated webpage for the new set, Datatang said the characters were collected by various samples written on A4 paper, square paper, and lined paper, among others.

By adding the characters to its software suite, Datatang enables customers to use OCR of the corresponding traditional Chinese characters when encountering them in the wild. In other words, by scanning a text through a smartphone and the Datatang app, users will now be able to automate data entry and filling out forms.

OCR is sometimes implemented for document scanning in digital identity verification and onboarding applications.

According to the company, the error bound of each vertex of the quadrilateral bounding box around each character is within five pixels, for a qualified annotation. The accuracy of bounding boxes and text transcription accuracy are both reportedly not less than 97 percent.

The addition of the new dataset comes months after Datatang executives said their speech recognition datasets were created with native language speakers and surpassed the industry’s standards. 

More recently, the company showcased its synthetic data generation technology at the 2022 Conference on Computer Vision and Pattern Recognition (CVPR 2022).

Article Topics

 |   |   |   | 

Latest Biometrics News

 

Governments grappling with biometrics to ease airport, public service access

Many of the biometrics providers convening in Abidjan, Cote d’Ivoire for ID4Africa’s 2026 AGM got a first-hand look at how…

 

Biometric Update Podcast: Claire Ma explores the next phase of government digital identity

Governments around the world are moving toward digital identity systems, but not all are taking the same path. On the…

 

Trusted Caller ID with digital wallet and VCs improves call center authentication

Decentralized digital IDs shared from a digital wallet on a smartphone can significantly speed up identity verification by call centers,…

 

EES records 66M border crossings in first six months despite rollout friction

During its first six months of operation of Europe’s biometric-based Entry-Exit System (EES), daily fingerprint checks against EU databases rose…

 

IDDEEA outlines role of e-signatures in Bosnia’s digital transformation

Qualified electronic signatures (QES) have the potential to bring significant improvements to complex, fragmented public administrations like those in Bosnia…

 

Luxembourg opens tender for AI-generated content detection tool

Luxembourg’s Ministry of Digitalization has opened a call for solutions to develop a deepfake detection platform intended to support the…

Comments

Leave a Reply

This site uses Akismet to reduce spam. Learn how your comment data is processed.

Biometric Market Analysis and Buyer's Guides

Most Viewed This Week

Featured Company

Biometrics Insight, Opinion

Digital ID In-Depth

Biometrics White Papers

Biometrics Events