DAOcc: 3D Object Detection Assisted Multi-Sensor Fusion for 3D Occupancy Prediction [
arxiv
]
Zhen Yang, Heng Wang, Yanpeng Dong
Beijing Mechanical Equipment Institute, Beijing, China
This is the official implementation of DAOcc. DAOcc is a novel multi-modal occupancy prediction framework that leverages 3D object detection to assist in achieving superior performance while using a deployment-friendly image encoder and practical input image resolution.
- 2025-01-31: Release the model weights and the first version of the code.
- 2024-10-01: Our preprint is available on arXiv.
3D Semantic Occupancy Prediction on Occ3D-nuScenes
Method | Camera Mask |
Image Backbone |
Image Resolution |
mIoU | Config | Model | Log |
---|---|---|---|---|---|---|---|
DAOcc | √ | R50 | 256×704 | 53.82 | config | model | log |
DAOcc* | √ | R50 | 256×704 | 54.19 | - | model | - |
Method | Camera Mask |
Image Backbone |
Image Resolution |
RayIoU | Config | Model | Log |
---|---|---|---|---|---|---|---|
DAOcc | × | R50 | 256×704 | 48.2 | config | model | log |
- * means use exponential moving average (EMA) hook.
3D Semantic Occupancy Prediction on SurroundOcc
Method | Image Backbone |
Image Resolution |
IoU | mIoU | Config | Model | Log |
---|---|---|---|---|---|---|---|
DAOcc | R50 | 256×704 | 45.0 | 30.5 | config | model | log |
3D Semantic Occupancy Prediction on OpenOccupancy
Method | Image Backbone |
Image Resolution |
IoU | mIoU | Config | Model | Log |
---|---|---|---|---|---|---|---|
DAOcc | R18 | 256×704 | 32.2 | 24.1 | - | - | - |
@article{yang2024daocc,
title={DAOcc: 3D Object Detection Assisted Multi-Sensor Fusion for 3D Occupancy Prediction},
author={Yang, Zhen and Dong, Yanpeng and Wang, Heng},
journal={arXiv preprint arXiv:2409.19972},
year={2024}
}
Many thanks to these excellent open-source projects: