Boosting Multi-View Indoor 3D Object Detection via Adaptive 3D Volume Construction

🚀 News

2025.8.06 update training instructions and pretrained models.
2025.7.25 arXiv preprint released.
2025.6.26 accepted by ICCV2025.

Abstract

This work presents SGCDet, a novel multi-view indoor 3D object detection framework based on adaptive 3D volume construction. Unlike previous approaches that restrict the receptive field of voxels to fixed locations on images, we introduce a geometry and context aware aggregation module to integrate geometric and contextual information within adaptive regions in each image and dynamically adjust the contributions from different views, enhancing the representation capability of voxel features. Furthermore, we propose a sparse volume construction strategy that adaptively identifies and selects voxels with high occupancy probabilities for feature refinement, minimizing redundant computation in free space. Benefiting from the above designs, our framework achieves effective and efficient volume construction in an adaptive way. Better still, our network can be supervised using only 3D bounding boxes, eliminating the dependence on ground-truth scene geometry. Experimental results demonstrate that SGCDet achieves state-of-the-art performance on the ScanNet, ScanNet200 and ARKitScenes datasets.

Method

Schematics and detailed architectures of SGCDet. (a) Overview of SGCDet, which consists of an image backbone to extract image features, a view transformation module to lift image features to 3D volumes, and a detection head to predict 3D bounding boxes. (b) Details of the coarse-to-fine refinement in our sparse volume construction strategy. (c) Details of our geometry and context aware aggregation module.

Getting Started

step 1. Refer to install.md to install the environment.

step 2. Refer to train_and_eval.md for training and evaluation.

Model Zoo

We provide the pretrained weights on ScanNet, ScanNet200, and ARKitScenes datasets, reproduced with the released codebase. You can also download the checkpoints from Hugging Face.

Dataset	Model	mAP@0.25	mAP@0.50	Checkpoint	Config
ScanNet	SGCDet	62.2	36.7	Link	Link
ScanNet200	SGCDet-L	28.9	14.4	Link	Link
ARKitScenes	SGCDet	66.2	54.0	Link	Link
ARKitScenes	SGCDet-L	70.4	57.0	Link	Link

License

This project is released under the Apache 2.0 license.

Contact

If you have any other problems, feel free to post questions in the issues section or contact Runmin Zhang (runmin_zhang@zju.edu.cn).

Acknowledgement

Many thanks to these exceptional open source projects:

As it is not possible to list all the projects of the reference papers. If you find we leave out your repo, please contact us and we'll update the lists.

Bibtex

If you find our work beneficial for your research, please consider citing our paper and give us a star:

@inproceedings{SGCDet,
 title = {Boosting Multi-View Indoor 3D Object Detection via Adaptive 3D Volume Construction},
 author = {Zhang, Runmin and Yu, Zhu and Cao, Si-Yuan and Zhu, Lingyu and Zhang, Guangyi and Bai, Xiaokai and Shen, Hui-Liang},
 booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision},
 year = {2025}
}

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
LightningTools		LightningTools
configs		configs
docs		docs
mmdet3d_plugin		mmdet3d_plugin
packages		packages
.gitignore		.gitignore
README.md		README.md
main.py		main.py
misc.py		misc.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Boosting Multi-View Indoor 3D Object Detection via Adaptive 3D Volume Construction

🚀 News

Abstract

Method

Getting Started

Model Zoo

License

Contact

Acknowledgement

Bibtex

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Boosting Multi-View Indoor 3D Object Detection via Adaptive 3D Volume Construction

🚀 News

Abstract

Method

Getting Started

Model Zoo

License

Contact

Acknowledgement

Bibtex

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages