https://github.com/bravegroup/pointsam-for-mixsup

Codes for ICLR 2024: "MixSup: Mixed-grained Supervision for Label-efficient LiDAR-based 3D Object Detection"

Science Score: 23.0%

This score indicates how likely this project is to be science-related based on various indicators:

○
CITATION.cff file
✓
codemeta.json file
Found codemeta.json file
○
.zenodo.json file
○
DOI references
✓
Academic publication links
Links to: arxiv.org, mdpi.com, ieee.org
○
Academic email domains
○
Institutional organization owner
○
JOSS paper metadata
○
Scientific vocabulary similarity
Low similarity (6.8%) to scientific vocabulary

Last synced: 10 months ago · JSON representation

Repository

Codes for ICLR 2024: "MixSup: Mixed-grained Supervision for Label-efficient LiDAR-based 3D Object Detection"

Basic Info

Host: GitHub
Owner: BraveGroup
License: apache-2.0
Language: Python
Default Branch: main
Homepage: https://arxiv.org/abs/2401.16305
Size: 4.16 MB

Statistics

Stars: 64
Watchers: 4
Forks: 6
Open Issues: 0
Releases: 0

Created over 2 years ago · Last pushed almost 2 years ago

Metadata Files

Readme License

PointSAM-for-MixSup

MixSup: Mixed-grained Supervision for Label-efficient LiDAR-based 3D Object Detection (ICLR 2024)

Yuxue Yang, Lue Fan†, Zhaoxiang Zhang† (†: Corresponding Authors)

[ :bookmark_tabs: Paper ] [ :octocat: GitHub Repo ] [ :paperclip: BibTeX ]

Teaser Figure

A good LiDAR-based detector needs massive semantic labels for difficult semantic learning but only a few accurate labels for geometry estimation.

MixSup is a practical and universality paradigm for label-efficient LiDAR-based 3D object detection, simultaneously utilizing cheap coarse labels and limited accurate labels.
MixSup achieves up to 97.31% of fully supervised performance with cheap cluster-level labels and only 10% box-level labels, which has been validated in nuScenes, Waymo Open Dataset, and KITTI.
MixSup can seamlessly integrate with various 3D detectors, such as SECOND, CenterPoint, PV-RCNN, and FSD.

PointSAM is a simple and effective method for MixSup to automatically segment cluster-level labels, further reducing the annotation burden.
PointSAM is on par with the recent fully supervised panoptic segmentation models for thing classes on nuScenes without any 3D annotations!

:raising_hand: Talk is cheap, show me the samples!

:star2: Panoptic segmentation performance for thing classes on nuScenes validation split

| Methods | PQ^Th | SQ^Th | RQ^Th | | :-----: | :-------------: | :-------------: | :-------------: | | GP-S3Net | 56.0 | 85.3 | 65.2 | | SMAC-Seg | 65.2 | 87.1 | 74.2 | |Panoptic-PolarNet | 59.2 | 84.1 | 70.3 | |SCAN | 60.6 | 85.7 | 70.2 | |PointSAM (Ours) | 63.7 | 82.6 | 76.9 |

Installation

PointSAM

Step 1. Create a conda environment and activate it.

shell conda create --name MixSup python=3.8 -y conda activate MixSup

Step 2. Install PyTorch following official instructions. The codes are tested on PyTorch 1.9.1, CUDA 11.1.

shell pip install torch==1.9.1+cu111 torchvision==0.10.1+cu111 -f https://download.pytorch.org/whl/torch_stable.html

Step 3. Install Segment Anything and torch_scatter.

shell pip install git+https://github.com/facebookresearch/segment-anything.git pip install https://data.pyg.org/whl/torch-1.9.0%2Bcu111/torch_scatter-2.0.9-cp38-cp38-linux_x86_64.whl

Step 4. Install other dependencies.

shell pip install -r requirements.txt

Dataset Preparation

nuScenes

Download nuScenes Full dataset and nuScenes-panoptic (for evaluation) from the official website, then extract and organize the data ito the following structure:

shell PointSAM-for-MixSup └── data └── nuscenes ├── maps ├── panoptic ├── samples ├── sweeps └── v1.0-trainval

Note: v1.0-trainval/category.json and v1.0-trainval/panoptic.json in nuScenes-panoptic will replace the original v1.0-trainval/category.json and v1.0-trainval/panoptic.json of the Full dataset.

Getting Started

First download the model checkpoints, then run the following commands to reproduce the results in the paper:

```shell

single-gpu

bash run.sh

multi-gpu

bash run_dist.sh ```

Note: 1. The default setting for run_dist.sh is to use 8 GPUs. If you want to use less GPUs, please modify the NUM_GPUS argument in run_dist.sh. 2. You can specify the SAMPLE_INDICES between scripts/indices_train.npy and scripts/indices_val.npy to run PointSAM on train or val split of nuScenes. The default setting is to segment the val split and evaluate the results on panoptic segmentation task. 3. Before running the scripts, please make sure that you have at least 850MB of free space in the OUT_DIR folder for val split and 4GB for train split. 4. segment3D.py is the main script for PointSAM. The argument --for_eval is used to generate labels with the same format as nuScenes-panoptic for evaluation, which is not necessary for MixSup. If you just want to utilize PointSAM for MixSup, please remove --for_eval in run.sh or run_dist.sh. We also provide a script to convert the labels generated by PointSAM between the .npz format for nuScenes-panoptic evaluation and .bin format for MixSup.

Model Checkpoints

We adopt ViT-H SAM as the segmentation model for PointSAM and utilize nuImages pre-trained HTC to integrate semantics for instance masks.

Click the following links to download the model checkpoints and put them in the ckpt/ folder to be consistent with the configuration in configs/cfg_PointSAM.py.

ViT-H SAM: ViT-H SAM model
HTC: HTC model

TODO

[x] Publish the code about PointSAM.
[ ] OpenPCDet based MixSup.
[ ] MMDetection3D based MixSup.

Citation

Please consider citing our work as follows if it is helpful.

@inproceedings{yang2024mixsup, title={MixSup: Mixed-grained Supervision for Label-efficient LiDAR-based 3D Object Detection}, author={Yang, Yuxue and Fan, Lue and Zhang, Zhaoxiang}, booktitle={ICLR}, year={2024}, }

Acknowledgement

This project is based on the following repositories.

Owner

Name: BraveGroup
Login: BraveGroup
Kind: organization

Repositories: 3
Profile: https://github.com/BraveGroup

GitHub Events

Total

Watch event: 8

Last Year

Watch event: 8

Dependencies

requirements.txt pypi

mmcv_full ==1.7.2
mmdet ==2.28.2
numpy ==1.23.0
opencv_python ==4.9.0.80
pyquaternion ==0.9.9
scipy ==1.10.1
torch ==1.9.1
torchvision ==0.10.1
tqdm *

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Open Source Science

https://github.com/bravegroup/pointsam-for-mixsup

Science Score: 23.0%

Repository

Basic Info

Statistics

Metadata Files

README.md

PointSAM-for-MixSup

MixSup: Mixed-grained Supervision for Label-efficient LiDAR-based 3D Object Detection (ICLR 2024)

:raising_hand: Talk is cheap, show me the samples!

:star2: Panoptic segmentation performance for thing classes on nuScenes validation split

Installation

PointSAM

Dataset Preparation

nuScenes

Getting Started

single-gpu

multi-gpu

Model Checkpoints

TODO

Citation

Acknowledgement

Owner

GitHub Events

Total

Last Year

Dependencies