Skip to content

Latest commit

 

History

History
68 lines (47 loc) · 4.56 KB

README.md

File metadata and controls

68 lines (47 loc) · 4.56 KB

DAOcc

PWC

DAOcc: 3D Object Detection Assisted Multi-Sensor Fusion for 3D Occupancy Prediction [arxiv]
Zhen Yang, Heng Wang, Yanpeng Dong
Beijing Mechanical Equipment Institute, Beijing, China

This is the official implementation of DAOcc. DAOcc is a novel multi-modal occupancy prediction framework that leverages 3D object detection to assist in achieving superior performance while using a deployment-friendly image encoder and practical input image resolution.

News

  • 2025-01-31: Release the model weights and the first version of the code.
  • 2024-10-01: Our preprint is available on arXiv.

Experimental results

3D Semantic Occupancy Prediction on Occ3D-nuScenes

Method Camera
Mask
Image
Backbone
Image
Resolution
mIoU Config Model Log
DAOcc R50 256×704 53.82 config model log
DAOcc* R50 256×704 54.19 - model -
Method Camera
Mask
Image
Backbone
Image
Resolution
RayIoU Config Model Log
DAOcc × R50 256×704 48.2 config model log
  • * means use exponential moving average (EMA) hook.

3D Semantic Occupancy Prediction on SurroundOcc

Method Image
Backbone
Image
Resolution
IoU mIoU Config Model Log
DAOcc R50 256×704 45.0 30.5 config model log

3D Semantic Occupancy Prediction on OpenOccupancy

Method Image
Backbone
Image
Resolution
IoU mIoU Config Model Log
DAOcc R18 256×704 32.2 24.1 - - -

Getting Started

Citation

@article{yang2024daocc,
  title={DAOcc: 3D Object Detection Assisted Multi-Sensor Fusion for 3D Occupancy Prediction},
  author={Yang, Zhen and Dong, Yanpeng and Wang, Heng},
  journal={arXiv preprint arXiv:2409.19972},
  year={2024}
}

Acknowledgements

Many thanks to these excellent open-source projects: