RTMDet: Optimized for Qualcomm Devices

RTMDet is a highly efficient model for real-time object detection,capable of predicting both the bounding boxes and classes of objects within an image.It is highly optimized for real-time applications, making it reliable for industrial and commercial use

This is based on the implementation of RTMDet found here. This repository contains pre-exported model files optimized for Qualcomm® devices. You can use the Qualcomm® AI Hub Models library to export with custom configurations. More details on model performance across various devices, can be found here.

Qualcomm AI Hub Models uses Qualcomm AI Hub Workbench to compile, profile, and evaluate this model. Sign up to run these models on a hosted Qualcomm® device.

Getting Started

Due to licensing restrictions, we cannot distribute pre-exported model assets for this model. Use the Qualcomm® AI Hub Models Python library to compile and export the model with your own:

  • Custom weights (e.g., fine-tuned checkpoints)
  • Custom input shapes
  • Target device and runtime configurations

See our repository for RTMDet on GitHub for usage instructions.

Model Details

Model Type: Model_use_case.object_detection

Model Stats:

  • Model checkpoint: RTMDet Medium
  • Input resolution: 640x640
  • Number of parameters: 27.5M
  • Model size (float): 105 MB

Performance Summary

Model Runtime Precision Chipset Inference Time (ms) Peak Memory Range (MB) Primary Compute Unit
RTMDet ONNX float Snapdragon® X2 Elite 8.171 ms 53 - 53 MB NPU
RTMDet ONNX float Snapdragon® X Elite 14.218 ms 51 - 51 MB NPU
RTMDet ONNX float Snapdragon® 8 Gen 3 Mobile 10.625 ms 5 - 235 MB NPU
RTMDet ONNX float Qualcomm® QCS8550 (Proxy) 13.593 ms 0 - 55 MB NPU
RTMDet ONNX float Qualcomm® QCS9075 23.627 ms 5 - 12 MB NPU
RTMDet ONNX float Snapdragon® 8 Elite For Galaxy Mobile 8.307 ms 3 - 184 MB NPU
RTMDet ONNX float Snapdragon® 8 Elite Gen 5 Mobile 5.973 ms 5 - 189 MB NPU
RTMDet ONNX w8a16_mixed_fp16 Snapdragon® X2 Elite 11.226 ms 32 - 32 MB NPU
RTMDet ONNX w8a16_mixed_fp16 Snapdragon® X Elite 29.616 ms 29 - 29 MB NPU
RTMDet ONNX w8a16_mixed_fp16 Snapdragon® 8 Gen 3 Mobile 22.087 ms 3 - 383 MB NPU
RTMDet ONNX w8a16_mixed_fp16 Qualcomm® QCS8550 (Proxy) 28.159 ms 0 - 36 MB NPU
RTMDet ONNX w8a16_mixed_fp16 Qualcomm® QCS9075 32.76 ms 2 - 5 MB NPU
RTMDet ONNX w8a16_mixed_fp16 Snapdragon® 8 Elite For Galaxy Mobile 14.374 ms 1 - 311 MB NPU
RTMDet ONNX w8a16_mixed_fp16 Snapdragon® 8 Elite Gen 5 Mobile 10.413 ms 3 - 328 MB NPU
RTMDet TFLITE float Snapdragon® 8 Gen 3 Mobile 11.722 ms 0 - 280 MB NPU
RTMDet TFLITE float Qualcomm® QCS8275 (Proxy) 84.037 ms 0 - 207 MB NPU
RTMDet TFLITE float Qualcomm® QCS8550 (Proxy) 16.034 ms 0 - 3 MB NPU
RTMDet TFLITE float Qualcomm® SA8775P 23.026 ms 0 - 209 MB NPU
RTMDet TFLITE float Qualcomm® QCS9075 25.322 ms 0 - 62 MB NPU
RTMDet TFLITE float Qualcomm® QCS8450 (Proxy) 38.082 ms 0 - 347 MB NPU
RTMDet TFLITE float Qualcomm® SA7255P 84.037 ms 0 - 207 MB NPU
RTMDet TFLITE float Qualcomm® SA8295P 29.911 ms 0 - 268 MB NPU
RTMDet TFLITE float Snapdragon® 8 Elite For Galaxy Mobile 9.169 ms 0 - 209 MB NPU
RTMDet TFLITE float Snapdragon® 8 Elite Gen 5 Mobile 6.836 ms 0 - 210 MB NPU

License

  • The license for the original implementation of RTMDet can be found here.

References

Community

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support