INT: Towards Infinite-frames 3D Detection with An Efficient Framework

It is natural to construct a multi-frame instead of a single-frame 3D detector for a continuous-time stream. Although increasing the number of frames might improve performance, previous multi-frame studies only used very limited frames to build their systems due to the dramatically increased computational and memory cost. To address these issues, we propose a novel on-stream training and prediction framework that, in theory, can employ an infinite number of frames while keeping the same amount of computation as a single-frame detector. This infinite framework (INT), which can be used with most existing detectors, is utilized, for example, on the popular CenterPoint, with significant latency reductions and performance improvements. We've also conducted extensive experiments on two large-scale datasets, nuScenes and Waymo Open Dataset, to demonstrate the scheme's effectiveness and efficiency. By employing INT on CenterPoint, we can get around 7% (Waymo) and 15% (nuScenes) performance boost with only 2~4ms latency overhead, and currently SOTA on the Waymo 3D Detection leaderboard.

INT: Towards Infinite-frames 3D Detection with An Efficient Framework,
ver.ECCV2022 | ver.Arxiv(more details)

@article{xu2022int,
  title={INT: Towards Infinite-frames 3D Detection with An Efficient Framework},
  author={Xu, Jianyun and Miao, Zhenwei and et al.},
  journal={ECCV},
  year={2022},
}

Highlights

Simple and Fast: INT is an on-stream multi-frame system made up of Memory Bank and Dynamic Training Sequence Length that can theoretically be trained and predicted using infinite frames while consuming similar computation and memory as a single-frame system.
SOTA: Our 100-frames INT is currently among SOTAs (w/o ensemble) on Waymo 3D Detection leaderboard.
Extensible: INT can be employed on most detectors, even for other tasks, like segmentation.

Main results

3D detection on Waymo val set

	#Frame	Veh_L2	Ped_L2	Cyc_L2	MAPH	latency(ms)
INT-1s	2	69.4	69.1	72.6	70.3	74.0
INT-1s	10	72.2	72.1	75.3	73.2	74.0
INT-2s	2	70.8	68.7	73.1	70.8	78.9
INT-2s	10	73.3	71.9	75.6	73.6	78.9

3D detection on Waymo test set

	#Frame	Veh_L2	Ped_L2	Cyc_L2	MAPH	latency(ms)
INT-2s	10	76.2	72.8	72.7	73.9	78.9
INT-2s	100	77.6	74.0	74.1	75.2	78.9

1s stands for 1-stage, 2s stands for 2-stage. All results are tested on a GeForce RTX 2070 SUPER with batch size 1.

Use INT

Installation

Please refer to INSTALL to set up libraries needed for distributed training and sparse convolution.

Benchmark Evaluation and Training

Please refer to GETTING_START to prepare the data.

Use configs with "small" tag in configs to reproduce our results. The "big" tag ones bring better results, and you can try it too.

License

INT is release under MIT license (see LICENSE). It is developed based on CenterPoint. Note that both nuScenes and Waymo datasets are under non-commercial licenses.

Acknowlegement

We sincerely thank the following open-source code.

CenterPoint

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
configs/waymo/voxelnet		configs/waymo/voxelnet
det3d		det3d
docs		docs
tools		tools
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
setup.sh		setup.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

INT: Towards Infinite-frames 3D Detection with An Efficient Framework

Highlights

Main results

3D detection on Waymo val set

3D detection on Waymo test set

Use INT

Installation

Benchmark Evaluation and Training

License

Acknowlegement

About

Releases

Packages

Languages

License

ADLab-AutoDrive/INT

Folders and files

Latest commit

History

Repository files navigation

INT: Towards Infinite-frames 3D Detection with An Efficient Framework

Highlights

Main results

3D detection on Waymo val set

3D detection on Waymo test set

Use INT

Installation

Benchmark Evaluation and Training

License

Acknowlegement

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages