FB pixel

PXL Vision integrates deepfake detection technique from research with Idiap

Idiap research paper tests Vision Language Models against text manipulation
PXL Vision integrates deepfake detection technique from research with Idiap
 

A research collaboration between PXL Vision, the Idiap Research Institute and Swiss innovation agency Innosuisse has led to the identity verification provider launching a new deepfake detector to spot fake or manipulated faces in identity documents.

The detection work was conducted as part of the project ROSALIND (106.729 IP-ICT), funded by Innosuisse in collaboration with PXL Vision. From PXL’s perspective, the collaboration has resulted in advanced deepfake detection capabilities designed specifically for automated IDV, which have already been fully integrated into the PXL Ident platform.

The new capabilities enable face swapping and face reenactment detection, as well as detection of fully synthetic identities created using generative AI. In a LinkedIn post, the firm says it aims to “increase resilience against AI-driven identity attacks and provide greater confidence in compliance-critical onboarding processes, without creating additional friction for users.”

In a press release, Idiap’s Sébastien Marcel says “this achievement reflects Idiap’s long-standing expertise in biometrics, AI security and trusted digital identity. Our collaboration with PXL Vision has been both productive and inspiring.”

“While our experience with identity documents was limited, we were able to build on PXL’s strong expertise in this area. The result achieved has exceeded our initial expectation, and we look forward to continuing to push the boundaries of robust, privacy-compliant identity verification in the future.”

Last year Idiap and PXL Vision hosted a competition that challenged participants to detect injection attacks in identity documents.

Detecting deepfaked text on ID documents

Another outcome from the research is a new paper from the Biometrics Security & Privacy group at Idiap looking at “Detecting Text Manipulation in Images using Vision Language Models.”

“Recent works have shown the effectiveness of Large Vision Language Models (VLMs or LVLMs) in image manipulation detection,” says the abstract. “However, text manipulation detection is largely missing in these studies.” The research aims to close the knowledge gap by analyzing closed and open-source VLMs on different text manipulation datasets.

Rapid progress in the quality of AI-assisted image generation means manipulated images have become more difficult to visually identify, says the paper, “especially when small but semantically critical regions, such as text, are modified. Detecting such subtle changes is challenging and current image forgery detection methods often overlook the manipulated text regions.”

The authors of the paper, Vidit Vidit, Pavel Korshunov, Amir Mohammadi, Christophe Ecabert, Ketan Kotwal and Sébastien Marcel, find that “open-source models are getting closer, but still behind closed-source ones like GPT4o.”

The research benchmarks image manipulation detection algorithms against OSTF and FantasyID datasets, and concludes that “GPT-4o is much better than open source models. Qwen-2.5 is the best performing open source model. Specialised VLMs like FakeShield and SIDA fail to generalize to the new task of text manipulation detection.” This latter issue is labeled “the generalization problem” and defined as follows: “In their training and evaluation datasets,” VLMs “rarely see text manipulation, making them biased towards artifacts of the scene manipulations.”

In other words, detectors are paying attention to the pictures, but ignoring the words.

Related Posts

Article Topics

 |   |   |   |   |   |   | 

Latest Biometrics News

 

As identity infrastructure scales, governance becomes the differentiator

Biometrics bound to credentials increasingly underpin the trust infrastructure of digital life, yet as digital systems reach deployment, they are…

 

Imprivata CEO tells Biometric Update Podcast why identity must evolve faster

A lot of people will tell you how fast the tech industry moves. Fran Rosch, the CEO of Imprivata, has…

 

Passenger growth, AI fraud push digital travel credentials toward tipping point

Digital travel credentials (DTCs) are at a crucial moment in their adoption as the travel industry undergoes profound structural changes,…

 

Thales makes strong debut in NIST’s FRIF fingerprint biometrics benchmark

New entries to NIST’s benchmark for large-scale fingerprint biometric capture and comparison software from Thales and Innovatrics show significant gains…

 

CCIA entreats US Supreme Court to intervene in Texas app store age check law

In the present historical moment, it is borderline comical to see advocacy groups for the technology industry insist that age…

 

The US counter-cartel fight is becoming an identity intelligence war

The creation of the Joint Interagency Task Force-Counter Cartel (JIATF-CC) under the U.S. Northern Command (NORTHCOM) marks more than another…

Comments

Leave a Reply

This site uses Akismet to reduce spam. Learn how your comment data is processed.

Biometric Market Analysis and Buyer's Guides

Most Viewed This Week

Featured Company

Biometrics Insight, Opinion

Digital ID In-Depth

Biometrics White Papers

Biometrics Events