-
Notifications
You must be signed in to change notification settings - Fork 5
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
- Loading branch information
Kadir Yilmaz
committed
Apr 10, 2024
1 parent
d0824db
commit 1025112
Showing
75 changed files
with
129 additions
and
6,622 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,4 +1,4 @@ | ||
# MASK4D: Mask Transformer for 4D Panoptic Segmentation | ||
# Mask4Former: Mask Transformer for 4D Panoptic Segmentation (Renamed from MASK4D) | ||
<div align="center"> | ||
<a href="https://github.com/YilmazKadir/">Kadir Yilmaz</a>, | ||
<a href="https://jonasschult.github.io/">Jonas Schult</a>, | ||
|
@@ -7,7 +7,7 @@ | |
|
||
RWTH Aachen University | ||
|
||
MASK4D is a transformer-based model for 4D Panoptic Segmentation, achieving a new state-of-the-art performance on the SemanticKITTI test set. | ||
Mask4Former is a transformer-based model for 4D Panoptic Segmentation, achieving a new state-of-the-art performance on the SemanticKITTI test set. | ||
|
||
<a href="https://pytorch.org/get-started/locally/"><img alt="PyTorch" src="https://img.shields.io/badge/PyTorch-ee4c2c?logo=pytorch&logoColor=white"></a> | ||
<a href="https://pytorchlightning.ai/"><img alt="Lightning" src="https://img.shields.io/badge/-Lightning-792ee5?logo=pytorchlightning&logoColor=white"></a> | ||
|
@@ -19,11 +19,12 @@ MASK4D is a transformer-based model for 4D Panoptic Segmentation, achieving a ne | |
</div> | ||
<br><br> | ||
|
||
[[Project Webpage](https://vision.rwth-aachen.de/mask4d)] [[arXiv](https://arxiv.org/abs/2309.16133)] | ||
[[Project Webpage](https://vision.rwth-aachen.de/Mask4Former)] [[arXiv](https://arxiv.org/abs/2309.16133)] | ||
|
||
## News | ||
* **2023-01-29**: Mask4Former accepted to ICRA 2024 | ||
|
||
* **2023-09-28**: Paper on arXiv | ||
* **2023-09-28**: Mask4Former on arXiv | ||
|
||
### Dependencies | ||
The main dependencies of the project are the following: | ||
|
@@ -33,23 +34,17 @@ cuda: 11.7 | |
``` | ||
You can set up a conda environment as follows | ||
``` | ||
conda create --name mask4d python=3.8 | ||
conda activate mask4d | ||
pip install -r requirements.txt | ||
conda create --name mask4former python=3.8 | ||
conda activate mask4former | ||
|
||
pip install torch==1.13.0+cu117 torchvision==0.14.0+cu117 torchaudio==0.13.0 --extra-index-url https://download.pytorch.org/whl/cu117 | ||
pip install torch==1.13.0+cu117 torchvision==0.14.0+cu117 --extra-index-url https://download.pytorch.org/whl/cu117 | ||
|
||
pip install torch-scatter -f https://data.pyg.org/whl/torch-1.13.0+cu117.html | ||
pip install -r requirements.txt --no-deps | ||
|
||
pip install 'git+https://github.com/facebookresearch/detectron2.git@710e7795d0eeadf9def0e7ef957eea13532e34cf' --no-deps | ||
pip install git+https://github.com/NVIDIA/MinkowskiEngine.git -v --no-deps | ||
|
||
cd third_party/pointnet2 && python setup.py install | ||
pip install git+https://github.com/facebookresearch/[email protected] --no-deps | ||
|
||
cd .. | ||
git clone https://github.com/NVIDIA/MinkowskiEngine.git | ||
cd MinkowskiEngine | ||
python setup.py install | ||
cd ../.. | ||
``` | ||
|
||
### Data preprocessing | ||
|
@@ -66,7 +61,7 @@ python -m datasets.preprocessing.semantic_kitti_preprocessing make_instance_data | |
``` | ||
|
||
### Training and testing | ||
Train MASK4D: | ||
Train Mask4Former: | ||
```bash | ||
python main_panoptic.py | ||
``` | ||
|
@@ -86,16 +81,16 @@ general.ckpt_path='PATH_TO_CHECKPOINT.ckpt' \ | |
general.dbscan_eps=1.0 | ||
``` | ||
## Trained checkpoint | ||
[MASK4D](https://omnomnom.vision.rwth-aachen.de/data/mask4d/mask4d.ckpt) | ||
[Mask4Former](https://omnomnom.vision.rwth-aachen.de/data/mask4former/mask4former.ckpt) | ||
|
||
The provided model, trained after the submission, achieves 71.1 LSTQ without DBSCAN and 71.5 with DBSCAN post-processing. | ||
|
||
## BibTeX | ||
``` | ||
@article{yilmaz2023mask4d, | ||
title = {{MASK4D: Mask Transformer for 4D Panoptic Segmentation}}, | ||
@inproceedings{yilmaz24mask4former, | ||
title = {{Mask4Former: Mask Transformer for 4D Panoptic Segmentation}}, | ||
author = {Yilmaz, Kadir and Schult, Jonas and Nekrasov, Alexey and Leibe, Bastian}, | ||
journal = {arXiv prepring arXiv:2309.16133}, | ||
year = {2023} | ||
booktitle = {{International Conference on Robotics and Automation (ICRA)}}, | ||
year = {2024} | ||
} | ||
``` |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,5 +1,5 @@ | ||
# @package _group_ | ||
_target_: models.Mask4D | ||
_target_: models.Mask4Former | ||
|
||
# backbone | ||
backbone: | ||
|
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,24 +1,96 @@ | ||
absl-py==2.1.0 | ||
aiohttp==3.9.3 | ||
aiosignal==1.3.1 | ||
antlr4-python3-runtime==4.8 | ||
async-timeout==4.0.3 | ||
attrs==23.2.0 | ||
black==23.3.0 | ||
cachetools==5.3.3 | ||
certifi==2024.2.2 | ||
charset-normalizer==3.3.2 | ||
click==8.1.7 | ||
docker-pycreds==0.4.0 | ||
filelock==3.13.4 | ||
fire==0.5.0 | ||
frozenlist==1.4.1 | ||
fsspec==2024.3.1 | ||
fvcore==0.1.5.post20221221 | ||
gitdb==4.0.11 | ||
GitPython==3.1.43 | ||
google-auth==2.29.0 | ||
google-auth-oauthlib==1.0.0 | ||
grpcio==1.62.1 | ||
hydra-core==1.0.5 | ||
omegaconf==2.0.6 | ||
python-dotenv==0.20.0 | ||
plyfile==0.7.4 | ||
trimesh==3.14.0 | ||
loguru==0.6.0 | ||
wandb==0.13.2 | ||
fvcore==0.1.5.post20220512 | ||
cloudpickle==2.1.0 | ||
albumentations==1.2.1 | ||
volumentations==0.1.8 | ||
matplotlib==3.5.3 | ||
pyviz3d==0.2.28 | ||
idna==3.6 | ||
importlib-metadata==3.10.1 | ||
tensorboard==2.10.0 | ||
importlib_resources==6.4.0 | ||
iopath==0.1.10 | ||
Jinja2==3.1.3 | ||
joblib==1.4.0 | ||
loguru==0.6.0 | ||
Markdown==3.3.4 | ||
MarkupSafe==2.1.5 | ||
mpmath==1.3.0 | ||
multidict==6.0.5 | ||
mypy-extensions==1.0.0 | ||
natsort==8.3.1 | ||
networkx==3.1 | ||
ninja==1.11.1 | ||
numpy==1.24.4 | ||
nvidia-cublas-cu12==12.1.3.1 | ||
nvidia-cuda-cupti-cu12==12.1.105 | ||
nvidia-cuda-nvrtc-cu12==12.1.105 | ||
nvidia-cuda-runtime-cu12==12.1.105 | ||
nvidia-cudnn-cu12==8.9.2.26 | ||
nvidia-cufft-cu12==11.0.2.54 | ||
nvidia-curand-cu12==10.3.2.106 | ||
nvidia-cusolver-cu12==11.4.5.107 | ||
nvidia-cusparse-cu12==12.1.0.106 | ||
nvidia-nccl-cu12==2.19.3 | ||
nvidia-nvjitlink-cu12==12.4.127 | ||
nvidia-nvtx-cu12==12.1.105 | ||
oauthlib==3.2.2 | ||
omegaconf==2.0.6 | ||
packaging==24.0 | ||
pathspec==0.12.1 | ||
pathtools==0.1.2 | ||
pillow==10.3.0 | ||
platformdirs==4.2.0 | ||
portalocker==2.8.2 | ||
promise==2.3 | ||
protobuf==3.20.3 | ||
psutil==5.9.8 | ||
pyasn1==0.6.0 | ||
pyasn1_modules==0.4.0 | ||
pyDeprecate==0.3.2 | ||
antlr4-python3-runtime==4.8 | ||
black==23.3.0 | ||
python-dotenv==0.20.0 | ||
pytorch-lightning==1.7.2 | ||
ninja==1.11.1 | ||
wheel==0.38.4 | ||
PyYAML==5.4.1 | ||
requests==2.31.0 | ||
requests-oauthlib==2.0.0 | ||
rsa==4.9 | ||
scikit-learn==1.3.2 | ||
scipy==1.10.1 | ||
sentry-sdk==1.45.0 | ||
setproctitle==1.3.3 | ||
shortuuid==1.0.13 | ||
six==1.16.0 | ||
smmap==5.0.1 | ||
sympy==1.12 | ||
tabulate==0.9.0 | ||
tensorboard==2.14.0 | ||
tensorboard-data-server==0.7.2 | ||
termcolor==2.4.0 | ||
threadpoolctl==3.4.0 | ||
tomli==2.0.1 | ||
torchmetrics==0.11.4 | ||
natsort==8.3.1 | ||
fire==0.5.0 | ||
tqdm==4.66.2 | ||
triton==2.2.0 | ||
typing_extensions==4.11.0 | ||
urllib3==2.2.1 | ||
volumentations==0.1.8 | ||
wandb==0.13.2 | ||
Werkzeug==3.0.2 | ||
yacs==0.1.8 | ||
yarl==1.9.4 | ||
zipp==3.18.1 |
This file was deleted.
Oops, something went wrong.
Oops, something went wrong.