Issues using Custom Object Detection with ONNX model (output shape 1, 5, 8400)

adrian · May 21, 2025, 4:05pm

Hello,

In the past, I was successfully using YOLO (Ultralytics) for custom object detection with my ZED cameras. I recently switched to using the ZED SDK’s built-in Custom Object Detection pipeline, trying to integrate my YOLO model via the custom_onnx_file option.

However, when using either the .onnx or .engine file exported from YOLO, the object is not detected — nothing appears, even though the model works fine outside the SDK.

After troubleshooting, ChatGPT suggested that the issue might be due to the ONNX output shape. My current model outputs:
(1, 5, N)
But it seems that the ZED SDK expects something like:
(1, N, 5)
Can someone confirm what the expected ONNX output shape is for custom models in the ZED SDK?
How to make my yolo custom object detection model with that shape?

Any advice or sample working ONNX output spec would be greatly appreciated.

adujardin · May 21, 2025, 7:02pm

Hi,
For a typical Ultralytics YOLO model, it’s expecting a shape like [1,84,7581].
It should be an ONNX file. There are instructions per model to export the correct format here: How to Use Export YOLO ONNX model to use the ZED Custom Object Detection - Stereolabs
YOLOv6 is indeed like [1,8400,85] (all these examples are for the default COCO, 80 classes) and is also handled

Could you post a screenshot of, like netron showing the shapes? For instance, here’s a compatible yolov8n with fixed size:

If possible, you could also share your onnx file to troubleshoot (it can be privately sent to support@stereolabs.com)

adrian · May 22, 2025, 10:26am

Hello, I’m using yolo12n detection fine tuned on a single class this means is a binary model. I’m exporting my model with yolo into an onnx file but when actually using as an onnx file just with ask it is not detecting anything. Must say that my cameras are subscribed to fusion and within python the wrapper is not implemented to get detection from fusion and I’m using the method grab to receive frames or retreive_image.

adujardin · May 22, 2025, 3:30pm

yolo12 is supported, make sure your onnx file is working as expected outside of the ZED SDK using a third-party code. Something like this looks like it could do the job GitHub - mohamedsamirx/YOLOv12-ONNX-CPP: YOLOv12 Inference Using CPP and ONNX Runtime

You should also try the default COCO-trained model to make sure the process is correct.

adrian · May 22, 2025, 3:55pm

Do you have discord or some other messaging application to talk and share my code and model?

Myzhar · May 22, 2025, 5:18pm

Hi @adrian
please send an email to support@stereolabs.com

adujardin · May 23, 2025, 10:25am

Thanks, we received your files.

I tested both of your models and they work as expected using the pth with ultralytics framework, and also the ONNX using the ZED SDK (This sample specifically to make sure zed-sdk/object detection/custom detector/cpp/tensorrt_yolov5-v6-v8_onnx_internal at master · stereolabs/zed-sdk · GitHub)

Using Python you can start from this one zed-sdk/object detection/custom detector/python/yolov5-v6-v8_onnx_internal at master · stereolabs/zed-sdk · GitHub