YOLOatr : Deep Learning Based Automatic Target Detection and Localization in Thermal Infrared Imagery
- 1. National University of Sciences and Technology, Islamabad, Pakistan
- 2. Mälardalen University, Västerås, Sweden
- 3. University of Galway, Ireland
Description
Automatic Target Detection (ATD) and Recognition (ATR) from Thermal Infrared (TI) imagery in the defense and surveillance domain is a challenging computer vision (CV) task in comparison to the commercial autonomous vehicle perception domain. Limited datasets, peculiar domain-specific and TI modality-specific challenges i.e., limited hardware, scale invariance issues due to greater distances, deliberate occlusion by tactical vehicles, lower sensor resolution and resultant lack of structural information in targets, effects of weather, temperature, and time of day variations and varying target to clutter ratios all result in increased intra-class variability and higher inter-class similarity making accurate real-time ATR a challenging CV task. Resultantly, contemporary state-of-the-art (SOTA) deep learning architecture under-perform in the ATR domain. We propose a modified anchor-based single-stage detector called YOLOatr, based on a modified YOLOv5s, with optimum modifications to detection heads, feature-fusion in the neck, and a custom augmentation profile. We evaluate the performance of our proposed model on a comprehensive DSIAC MWIR dataset for real-time ATR over both correlated and decorrelated testing protocols. The results demonstrate that our proposed model achieves state-of-the-art ATR performance of up to 99.6%.