MV-SAM3D

MV-SAM3D is a multi-view 3D reconstruction framework that extends SAM 3D Objects to leverage observations from multiple viewpoints. It supports both single-object and multi-object generation, and is designed to produce more stable geometry, texture, and scene-level consistency.

Paper

arXiv: https://arxiv.org/abs/2603.11633

Installation

Please follow the environment setup from:

Data Format

scene/
├── images/
│   ├── 0.png
│   ├── 1.png
│   └── ...
├── object_a/
│   ├── 0.png
│   ├── 1.png
│   └── ...
├── object_b/
│   └── ...
└── ...

Mask files are RGBA PNG where alpha indicates foreground.

Results Comparison

Single-object

Single-View (View 3)

Single-View (View 6)

MV-SAM3D

Input Image

Input Images

↓ 3D Reconstruction ↓

_{Single-view baseline.}

_{Better multi-view consistency.}

Multi-object

SAM 3D (single-view)	MV-SAM3D w/o Pose Optimization	MV-SAM3D (full)
_{Shape and pose are often unstable.}	_{Multi-view improves object quality.}	_{Improved overall scene alignment.}

Quick Start

Single-object inference

python run_inference_weighted.py \
  --input_path ./data/example \
  --mask_prompt stuffed_toy \
  --da3_output ./da3_outputs/example/da3_output.npz

Multi-object inference

python run_inference_weighted.py \
  --input_path ./data/desk_objects0 \
  --mask_prompt keyboard,speaker,mug,stuffed_toy \
  --da3_output ./da3_outputs/desk_objects0/da3_output.npz \
  --merge_da3_glb \
  --run_pose_optimization

Default Settings (No Extra Flags)

For single-object inference (run_inference_weighted.py), key defaults are:

Stage 1 weighting: enabled (stage1_entropy_alpha=30.0)
Stage 2 weighting: enabled (stage2_weight_source=entropy)
Stage 2 alpha defaults: stage2_entropy_alpha=30.0, stage2_visibility_alpha=30.0

Preprocessing for a New Scene

python preprocessing/build_mvsam3d_dataset.py \
  --input data/your_scene \
  --objects keyboard,speaker,mug,stuffed_toy

python scripts/run_da3.py \
  --image_dir ./data/your_scene/images \
  --output_dir ./da3_outputs/your_scene

Citation

@article{li2026mv,
  title={MV-SAM3D: Adaptive Multi-View Fusion for Layout-Aware 3D Generation},
  author={Li, Baicheng and Wu, Dong and Li, Jun and Zhou, Shunkai and Zeng, Zecui and Li, Lusong and Zha, Hongbin},
  journal={arXiv preprint arXiv:2603.11633},
  year={2026}
}

Acknowledgments

We thank the authors of SAM 3D Objects and Depth Anything 3 for their excellent work.

License

Please refer to LICENSE for usage terms.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
data/example		data/example
environments		environments
notebook		notebook
preprocessing		preprocessing
sam3d_objects		sam3d_objects
scripts		scripts
.gitignore		.gitignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
requirements.dev.txt		requirements.dev.txt
requirements.inference.txt		requirements.inference.txt
requirements.p3d.txt		requirements.p3d.txt
requirements.txt		requirements.txt
run_inference.py		run_inference.py
run_inference_weighted.py		run_inference_weighted.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MV-SAM3D

Paper

Installation

Data Format

Results Comparison

Single-object

Multi-object

Quick Start

Single-object inference

Multi-object inference

Default Settings (No Extra Flags)

Preprocessing for a New Scene

Citation

Acknowledgments

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 1

Languages

Folders and files

Latest commit

History

Repository files navigation

MV-SAM3D

Paper

Installation

Data Format

Results Comparison

Single-object

Multi-object

Quick Start

Single-object inference

Multi-object inference

Default Settings (No Extra Flags)

Preprocessing for a New Scene

Citation

Acknowledgments

License

About

Resources

License

Code of conduct

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 1

Languages

Packages