ml-depth-pro C++ tensorrt ⚡

🛠️ 导出 onnx

opset_version >= 17

import torch
from depth_pro import create_model_and_transforms

model, transform = create_model_and_transforms(
    device=torch.device('cuda:0')
)
model.eval()

x = torch.randn(1, 3, 1536, 1536, device='cuda:0')

with torch.no_grad():
    torch.onnx.export(
        model,
        x,
        "model/depth_pro.onnx",
        export_params=True,
        opset_version=17,
        do_constant_folding=True,
        input_names=['input'],
        output_names=['depth', 'fov'],
        keep_initializers_as_inputs=None)

🛠️ 构建tensorrt引擎

trtexec --onnx=model/depth_pro.onnx --saveEngine=depth_pro.engine --fp16 --verbose

⚡ 推理

make runpro

Compile CXX src/main.cpp
Link workspace/pro
------------------------------------------------------
TensorRT-Engine 🌱 is Static Shape model
Inputs: 1
	0.input : {1 x 3 x 1536 x 1536} [float32]
Outputs: 2
	0.depth : {1 x 1 x 1536 x 1536} [float32]
	1.fov : {1 x 1 x 1 x 1} [float32]
------------------------------------------------------
[⏰ timer]: 	19896.27734 ms

🏃 速度

在3090上循环跑100次耗时19896ms

nv::EventTimer t;
t.start();
for (int i = 0; i < 100; i++)
{
    int height = image.rows;
    auto depth_map = depth_pro->forward(TensorRT::cvimg(image));
    cv::Mat depth_mat(depth_map.depth_map);

    depth_mat = depth_mat.reshape(1, height);
}
t.stop();
// [⏰ timer]: 	19896.27734 ms

✉️ pybind11 封装

编译
```
make all
```

运行

>>> import cv2
>>> import trtdepthpro
>>> engine_path = "depth_pro.engine"
>>> image_path  = "images/bus.jpg"
>>> image = cv2.imread(image_path)
>>> model = trtdepthpro.TrtDepthProInfer(engine_path, 0)
------------------------------------------------------
TensorRT-Engine 🌱 is Static Shape model
Inputs: 1
    0.input : {1 x 3 x 1536 x 1536} [float32]
Outputs: 2
    0.depth : {1 x 1 x 1536 x 1536} [float32]
    1.fov : {1 x 1 x 1 x 1} [float32]
------------------------------------------------------
>>> res = model.forward(image)
>>> res
DepthMap(width: 810, height: 1080, fov_deg: 37.1668)

🧪 不同分辨率模型测试

测试结果为pytorch识别
使用不同分辨率模型脚本: workspace/convert_to_coreml.py
来源：https://github.com/harism/ml-depth-pro 备注：转为onnx后可以使用onnxruntime推理，转为engine后推理结果是错误的。

768 x 768 识别结果

1152 x 1152 识别结果

🤖 测试环境

系统
Ubuntu 22.04 LTS
docker镜像
nvcr.io/nvidia/tensorrt:24.10-py3
gpu
3090

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
src		src
workspace		workspace
.gitignore		.gitignore
1152x1152.jpg		1152x1152.jpg
768x768.jpg		768x768.jpg
Makefile		Makefile
README.md		README.md
example.png		example.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ml-depth-pro C++ tensorrt ⚡

🛠️ 导出 onnx

🛠️ 构建tensorrt引擎

⚡ 推理

🏃 速度

✉️ pybind11 封装

🧪 不同分辨率模型测试

768 x 768 识别结果

1152 x 1152 识别结果

🤖 测试环境

👏 参考

About

Releases

Packages

Languages

leon0514/ml-depth-pro-trt10

Folders and files

Latest commit

History

Repository files navigation

ml-depth-pro C++ tensorrt ⚡

🛠️ 导出 onnx

🛠️ 构建tensorrt引擎

⚡ 推理

🏃 速度

✉️ pybind11 封装

🧪 不同分辨率模型测试

768 x 768 识别结果

1152 x 1152 识别结果

🤖 测试环境

👏 参考

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages