Robotics

Quantized Object Detection for Real-Time Inference on Embedded GPU Architectures

Publié le - International journal of advanced computer science and applications (IJACSA)

Auteurs : Fatima Zahra Guerrouj, Sergio Alberto Rodriguez Florez, Abdelhafid El Ouardi, Mohamed Abouzahir, Mustapha Ramzi

Deploying deep learning-based object detection models like YOLOv4 on resource-constrained embedded architectures presents several challenges, particularly regarding computing performance, memory usage, and energy consumption. This study examines the quantization of the YOLOv4 model to facilitate real-time inference on lightweight edge devices, focusing on NVIDIA’s Jetson Nano and AGX. We utilize post-training quantization techniques to reduce both model size and computational complexity, all while striving to maintain acceptable detection accuracy. Experimental results indicate that an 8-bit quantized YOLOv4 model can achieve near real-time performance with minimal accuracy loss. This makes it well-suited for embedded applications such as autonomous navigation. Additionally, this research highlights the trade-offs between model compression and detection performance, proposing an optimization method tailored to the hardware constraints of embedded architectures.