open3dtrack
Code for Open3DTrack: Towards Open-Vocabulary 3D Multi-Object Tracking
Science Score: 54.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
○DOI references
-
✓Academic publication links
Links to: arxiv.org -
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (11.3%) to scientific vocabulary
Repository
Code for Open3DTrack: Towards Open-Vocabulary 3D Multi-Object Tracking
Basic Info
Statistics
- Stars: 24
- Watchers: 1
- Forks: 1
- Open Issues: 0
- Releases: 0
Metadata Files
README.md
Open3DTrack: Towards Open-Vocabulary 3D Multi-Object Tracking
🚀 Paper Accepted to ICRA 2025! 🎉
<!--
-->
Abstract
3D multi-object tracking plays a critical role in autonomous driving by enabling the real-time monitoring and prediction of multiple objects’ movements. Traditional 3D tracking systems are typically constrained by predefined object categories, limiting their adaptability to novel, unseen objects in dynamic environments. To address this limitation, we introduce open-vocabulary 3D tracking, which extends the scope of 3D tracking to include objects beyond predefined categories. We formulate the problem of open-vocabulary 3D tracking and introduce dataset splits designed to represent various open-vocabulary scenarios. We propose a novel approach that integrates open-vocabulary capabilities into a 3D tracking framework, allowing for generalization to unseen object classes. Our method effectively reduces the performance gap between tracking known and novel objects through strategic adaptation. Experimental results demonstrate the robustness and adaptability of our method in diverse outdoor driving scenarios. To the best of our knowledge, this work is the first to address open-vocabulary 3D tracking, presenting a significant advancement for autonomous systems in real-world settings.
Getting Started
Clone the repository and submodules:
bash
git clone --recurse-submodules https://github.com/ayesha-ishaq/Open3DTrack
Environment Setup
Create and activate the Conda environment:
bash conda create -n open3dtrack python=3.10 conda activate open3dtrackInstall dependencies:
bash pip install ultralytics conda install pytorch==1.13.1 torchvision==0.14.1 torchaudio==0.13.1 pytorch-cuda=11.6 -c pytorch -c nvidia pip install nuscenes-devkit matplotlib pandas motmetrics==1.2.0 conda install pyg -c pyg pip install pyg_lib torch_scatter torch_sparse torch_cluster torch_spline_conv -f https://data.pyg.org/whl/torch-1.13.0+cu116.htmlInstall third-party dependencies:
Refer to this repository for additional installations.
Dataset Preparation
- Download NuScenes dataset: NuScenes (Keyframes only)
Generate YOLOWorld Detections: Set the
$dataset_dirinyoloworld.pyto$output_dirtobash python yoloworld.py3D Detector Detections:
- CenterPoint Detection
- Megvii Detection
- BEVFusion Detection (To obtain BEVFusion results, you must install BEVFusion and run inference. Alternatively, you can use BEVFusion from mmdetection3d)
Rename the files as train.json and val.json and save them at
- Generate Split Data:
To preprocess 3D and 2D detections and obtain data for 3D tracker run:
Split scenarios are as proposed in the paper: rare, urban, and diverse. Set
to one of these. bash python generate_data_yoloworld.py \ --dataset_dir <path_to_nuscenes> \ --detection_dir <path_to_3D_detections> \ --output_dir <path_to_output> \ --yoloworld_dir <path_to_yoloworld_detections> \ --data_split_scenario <split_name> --apply_nms
Training and Evaluation
Update Config File:
Set paths for dataset and processed data inconfig/default.json, and set fieldsplitto the desired scenario (rare, urban, diverse)Training:
bash python train.py -c config/default.jsonEvaluation:
bash python train.py -c config/default.json -r <path_to_checkpoint> --eval_only -o <path_to_result_folder>
Experimental Results
| Split | AMOTA | AMOTP | Bicycle | Bus | Car | Motorcycle | Pedestrian | Trailer | Truck | Checkpoint| |-------|-------|-------|---------|------|------|------------|------------|---------|-------|-------| | Rare | 0.578 | 0.783 | 0.445 |0.612| 0.779| 0.469 | 0.752 | 0.477 | 0.511 | weights | Urban | 0.590 | 0.677 | 0.400 | 0.683| 0.788| 0.702 | 0.548 | 0.488 | 0.522 | weights | Diverse | 0.536| 0.804 | 0.524 | 0.770| 0.708| 0.438 | 0.564 | 0.470 | 0.276 |weights | UpperBound (3DMOTFormer) | 0.710 | 0.521 | 0.545 | 0.853 | 0.838 | 0.723 | 0.812 | 0.509 | 0.690 |
AMOTA and AMOTP show overall results, while only AMOTA results are shown for each class here. Bold values indicate novel classes of that split.
Acknowledgements
We would like to thank the following contributors and organizations for their support and resources:
- 3DMOTFormer
- [NuScenes](https://www.nuscenes.o]rg/)**
- Ultralytics
- All collaborators and contributors who have helped shape this project.
Owner
- Name: Ayesha Ishaq
- Login: ayesha-ishaq
- Kind: user
- Repositories: 1
- Profile: https://github.com/ayesha-ishaq
Citation (CITATION.cff)
cff-version: 1.2.0
message: "If you use this software, please cite it as below."
authors:
- family-names: Ishaq
given-names: Ayesha
orcid: https://orcid.org/0009-0004-9167-8622
title: "Open3DTrack: Towards Open-Vocabulary 3D Multi-Object Tracking"
date-released: 2024-09-30
url: "https://github.com/dsx0511/3DMOTFormer"
affiliation: "Mohamed Bin Zayed University of Artificial Intelligence"
GitHub Events
Total
- Issues event: 3
- Watch event: 13
- Issue comment event: 2
- Push event: 1
- Fork event: 1
Last Year
- Issues event: 3
- Watch event: 13
- Issue comment event: 2
- Push event: 1
- Fork event: 1
Issues and Pull Requests
Last synced: 11 months ago
All Time
- Total issues: 1
- Total pull requests: 0
- Average time to close issues: N/A
- Average time to close pull requests: N/A
- Total issue authors: 1
- Total pull request authors: 0
- Average comments per issue: 0.0
- Average comments per pull request: 0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0
Past Year
- Issues: 1
- Pull requests: 0
- Average time to close issues: N/A
- Average time to close pull requests: N/A
- Issue authors: 1
- Pull request authors: 0
- Average comments per issue: 0.0
- Average comments per pull request: 0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0
Top Authors
Issue Authors
- Cillian111 (2)
- rlczddl (1)