Automation, Vol. 6, Pages 77: Improved Object Detection and Tracking with Camera in Motion Using PrED: Predictive Enhancement of Detection
Automation doi: 10.3390/automation6040077
Authors:
Adibuzzaman Rahi
Hatem Wasfy
Tamer Wasfy
Sohel Anwar
While YOLO’s efficiency and accuracy have made it a popular choice for object detection and tracking in real-world applications, models trained on smaller datasets often suffer from intermittent detection failures, where objects remain undetected across multiple consecutive frames, significantly degrading tracking performance in practical scenarios. To address this challenge, we propose PrED (Predictive Enhancement of Detection), a novel framework that enhances object detection and aids in tracking by integrating low-confidence detections with multiple similarity metrics—including Intersection over Union (IoU), spatial distance similarity, and template similarity, and predicts the locations of undetected objects based on a parameter called predictability index. By maintaining object continuity during missed detections, PrED ensures robust tracking performance even when the underlying detection model experiences failures. Extensive evaluations across multiple benchmark datasets demonstrate PrED’s superior performance, achieving over 11% higher DetA with at least 6.9% MOTA improvement in our test scenarios, 17% higher detection accuracy (DetA) and 12.3% higher Multiple Object Tracking Accuracy (MOTA) on the KITTI training dataset, 8% higher DetA and 2.6% higher MOTA on the MOT17 training dataset, compared to ByteTrack, establishing PrED as an effective solution for enhancing tracking robustness in scenarios with suboptimal detection performance.
Source link
Adibuzzaman Rahi www.mdpi.com
