This Project is based on https://github.com/drprojects/DeepViewAgg
The following must be installed before installing this project.
- Anaconda3
- cuda >= 11.6
- gcc >= 7
All remaining dependencies (PyTorch, PyTorch Geometric, etc) should be installed using the provided installation script. For using the point transformer layer follow instruction in https://github.com/POSTECH-CVLab/point-transformer
The code has been tested in the following environment:
- Ubuntu 20.04.5 LTS
- Python 3.7.9
- PyTorch 1.12.0
- CUDA 11.6
- NVIDIA GeForce RTX 3080 10G
- 64G RAM
This is not the official Torch-Points3D framework. This work builds on and modifies a fixed version of the framework and has not been merged with the official repository yet. In particular, this repository introduces numerous features for multimodal learning on large-scale 3D point clouds. In this repository, some TP3D-specific files were removed for simplicity.
The project follows the original Torch-Points3D framework structure.
├─ conf # All configurations live there
├─ notebooks # Notebooks to get started with multimodal datasets and models
├─ eval.py # Eval script
├─ insall.sh # Installation script for DeepViewAgg
├─ scripts # Some scripts to help manage the project
├─ torch_points3d
├─ core # Core components
├─ datasets # All code related to datasets
├─ metrics # All metrics and trackers
├─ models # All models
├─ modules # Basic modules that can be used in a modular way
├─ utils # Various utils
└─ visualization # Visualization
└─ train.py # Main script to launch a trainingThe models we added can be found under conf/models/segmentation/multimodal/adl4cv-scannet.yaml
Scripts to train
scripts/train_kitti360.shscripts/train_s3dis.shscripts/train_scannet.sh
For more information on the project check https://github.com/drprojects/DeepViewAgg for detailed