Getting depth image from stereo images

Haining · April 23, 2025, 3:36pm

Hi all,

I recorded my dataset with stereo images from a ZED 2i camera (not a wise idea but unfortunately the data recording has already finished). Is there anyway that I can use ZED SDK to recreate the depth image? I can use OpenCV for this but I assume the depth quality from the SDK would be better.

Thanks in advance

adrian · April 23, 2025, 8:18pm

The ZED 2i is a stereo camera, which means it uses both lenses to estimate depth by comparing the left and right images—much like human vision.The ZED SDK computes depth by finding disparities between corresponding points in the left and right images after stereo rectification.

2 images (two lens): If your recording includes both the left and right images for each frame you can absolutely use the ZED SDK to recreate the depth map, and the sdk is better optimised than cv2.
Only single images (one lens): No, you cannot use the ZED SDK for stereo depth; you’d need to use monocular AI models, which are less accurate.**

Haining · April 23, 2025, 9:08pm

Hi Adrian, thank you so much for replying. I have the rectified left and right images recorded. I initially thought this would be doable with ZED SDK as well, however, after looking at the tutorials, I didn’t see how can I do this. Could you please point some directions for me? Anything information is appreciated!

adujardin · April 24, 2025, 7:21am

Hi,
The ZED SDK can’t input rectified images (like PNG files). It requires SVO files recorded from the ZED SDK.
The SVO files are proprietary and optimized for the ZED SDK. They contain mainly raw side-by-side images, just like the camera USB/GMSL stream. They can’t be recreated from images.

Haining · April 24, 2025, 11:35am

Hi,
Thanks for your reply. If ZED SDK is not an option, would you recommend any method or packages to recreate the depth images then? Thanks in advance!

adujardin · April 24, 2025, 12:33pm

You can use OpenCV, or more modern AI methods like FoundationStereo or IGEV, RAFT etc
This will generate the disparity map, then you’ll need to use your factory camera calibration parameters to get the depth and point cloud data

Haining · April 24, 2025, 4:43pm

I just tried FoundationStereo and it worked quite well (honestly the depth images seem even better than with ZED SDK). Thank you so much for your help!

BenjaminV · April 25, 2025, 3:45pm

FoundationStereo is indeed very impressive. However, I must highlight that it is a foundation model and is not designed to be deployed in a production environment. It’s also several orders of magnitude slower than our SDK on Jetson.