3 January 2025 LIS-DETR: small target detection transformer for autonomous driving based on learned inverted residual cascaded group
Yifei Chen, Ye Jin, Yang Wei, Weixin Hu, Zhihui Zhang, Chongzhou Wang, Xuechen Jiao
Author Affiliations +
Abstract

We propose a learned inverted residual cascaded group small target detection transformer (LIS-DETR) model, a novel approach for small object detection in autonomous driving. This model has a unique backbone network, the basic inverted residual cascaded group mobile block, which enhances feature representation and reduces computational redundancy. A dedicated small detection layer is integrated to improve small object detection specifically. In addition, an adaptive learned positional encoding transformer layer is incorporated to strengthen global contextual relationships, and the designed inner-SIoU loss function further accelerates convergence speed. Experimental results show a 3.1% increase in mAP50 accuracy on VisDrone datasets and a 1.9% improvement on processed SODA10m datasets compared with baseline methods. These advances demonstrate the LIS-DETR model’s strong generalization ability and the significant potential to enhance the efficacy of autonomous driving systems.

© 2025 SPIE and IS&T

Funding Statement

Yifei Chen, Ye Jin, Yang Wei, Weixin Hu, Zhihui Zhang, Chongzhou Wang, and Xuechen Jiao "LIS-DETR: small target detection transformer for autonomous driving based on learned inverted residual cascaded group," Journal of Electronic Imaging 34(1), 013003 (3 January 2025). https://doi.org/10.1117/1.JEI.34.1.013003
Received: 14 September 2024; Accepted: 18 December 2024; Published: 3 January 2025
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Object detection

Autonomous driving

Transformers

Data modeling

Target detection

Small targets

Education and training

Back to Top