FB pixel

Researchers claim improved small object detection with multi-modal 3D network

Researchers claim improved small object detection with multi-modal 3D network
 

A team of academic researchers has developed the Dynamic Point-Pixel Feature Alignment network, known as DPPFA-Net, to address the issue of accurately detecting small objects for robots and autonomous vehicles. When compared to other existing 3D object detection methods, this model demonstrated a significant improvement in average precision, achieving a 7.18 percent increase across various noise conditions.

Auto makers have been adding face biometrics to vehicles like the Genesis GV60 for door unlocking, even as smart automobile developers stake different positions on the kinds of optical sensors they need to meet the computer vision requirements of autonomous vehicles. A consensus on what kinds of cameras are needed for other applications may determine the imaging systems biometrics developers have to work with.

Despite the growing demand for autonomous vehicles and robotic automation solutions, object detection remains a complex task within AI workload. A critical aspect of this challenge involves the use of LiDAR sensors, which generate 3D point clouds that offer depth information about the surrounding environment. However, the LiDAR data is susceptible to noise, potentially causing errors in object detection.

To address this issue, a team led by Professor Hiroyuki Tomiyama from Ritsumeikan University in Japan has introduced a multi-modal 3D object detection approach that combines 3D LiDAR data with 2D RBG images captured by standard cameras. The research emphasizes the significance it holds in the field of robotics, as it enables robots to gain a better understanding of and adapt to their environment.

“Our study could facilitate a better understanding and adaptation of robots to their working environments, allowing a more precise perception of small targets,” explains Tomiyama. “Such advancements will help improve the capabilities of robots in various applications.”

The proposed system comprises several modules: a memory-based point-pixel fusion module (MPPF), a deformable point-pixel fusion module (DPPF), and a semantic alignment evaluator (SAE) module. These various specialized modules are integrated to enhance the accuracy and robustness of object detection in complex scenarios characterized by potential environmental noise.

The memory-based point-pixel fusion module facilitates the interaction among features within the same modality as well as across different modalities. The module utilizes 2D images as a memory bank, enabling the network to learn and adapt to noise in 3D point cloud data. In contrast, the deformable point-pixel fusion module focuses on interactions at specific pixel positions, maintaining high resolution while keeping computational complexity low.

“The DPPF module establishes interactions exclusively with key position pixels based on a sampling strategy. This design not only guarantees a low computational complexity but also enables adaptive fusion functionality, especially beneficial for high-resolution images. The SAE module guarantees semantic alignment of the fused features, thereby enhancing the robustness and reliability of the fusion process,” the researchers explain.

During the evaluation of the Dynamic Point-Pixel Feature Alignment network, the research team introduced artificial multi-modal noise into the KITTI dataset. According to the study’s findings, the proposed network stands out as one of the most advanced and accurate 3D object detection methods available.

Related Posts

Article Topics

 |   |   |   |   |   | 

Latest Biometrics News

 

Mexico tenders $20M contract to support biometric CURP at scale

The Mexican government has published a new tender worth 343 million Mexican pesos (US$19.9 million) to expand the operation of…

 

ITL’s age estimation headway in Germany proves market, regulatory readiness

Supermarket automation like self-checkout lanes and age assurance technologies are both familiar subjects to most people at this point; not…

 

IDBio joins Neurotechnology, Innovatrics with gold in UIDAI biometrics competition

The Unique Identification Authority of India has closed its Biometric SDK Benchmarking Competition, and recognized the winners for the face…

 

Baltic mobile IDs become mainstream alternative to document-based KYC

Clients of businesses using IDenfy technology for Know Your Customer (KYC) in the Baltic states of Estonia and Lithuania now…

 

Secret Service tests mobile FRT app as federal biometric policing expands

The U.S. Secret Service (USSS) has begun field testing a mobile facial recognition application called “Sentry” that could be at…

 

UK ICO plans guidance to build public confidence in AI, biometrics deployments

The UK Information Commissioner’s Office says in a response to government inquiries that it is making progress on the AI…

Comments

Leave a Reply

This site uses Akismet to reduce spam. Learn how your comment data is processed.

Biometric Market Analysis and Buyer's Guides

Most Viewed This Week

Featured Company

Biometrics Insight, Opinion

Digital ID In-Depth

Biometrics White Papers

Biometrics Events