Science Score: 44.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (14.3%) to scientific vocabulary
Last synced: 10 months ago · JSON representation ·

Repository

Basic Info
  • Host: GitHub
  • Owner: Anon-BOTs
  • License: apache-2.0
  • Language: Python
  • Default Branch: main
  • Size: 11.9 MB
Statistics
  • Stars: 1
  • Watchers: 1
  • Forks: 0
  • Open Issues: 1
  • Releases: 0
Created over 1 year ago · Last pushed over 1 year ago
Metadata Files
Readme License Citation

README.md

# GaussianPretrain: A Simple Unified 3D Gaussian Representation for Visual Pre-training in Autonomous Driving Authors

Introduction

💥GussianPretrain introduces 3D Gaussian Splatting technology into vision pre-training task for the first time. Which demonstrates remarkable effectiveness and robustness, achieving significant improvements across various 3D perception tasks, including 3D object detection, HD map reconstruction, and occupancy prediction, with efficiency and lower memory consumption.💥

pipeline

Qualitative Rendered Visualization

Image and Video DEMO

Rendered Image Visualization.

pipeline

Framework Modules Analysis and Rendered Video Visualization.

https://github.com/user-attachments/assets/4d77ea40-0567-461e-94f5-f499198f7f8a

News

  • [2025-03-05] 🚀 We incorporate our method with LiDAR modality.
  • [2025-01-31] 🚀 Complement rendered visualization images and video for better clear the reconstruction performance of our approach.
  • [2025-01-01] 💥 The experiments setting of UVTR-CS config and weight also released which not achieved in the paper.

  • [2025-01-01] 🚀 The complete code and associated weights have been released. By the way, Happy New Year to everyone! 💥.

  • [2024-11-20] The codebase is initialed. We are diligently preparing for a clean, optimized version. Stay tuned for the complete code release, which is coming soon..

Overview

💥The architecture of proposed GaussianPretrain. Given multi-view images, we first extract valid mask patches using the mask generator with the LiDAR Depth Guidance strategy. Subsequently, a set of learnable 3D Gaussian anchors is generated using ray-based guidance and conceptualized as volumetric LiDAR points. Finally, the reconstruction signals of RGB, Depth, and Occupancy are decoded based on the predicted Gaussian anchor parameters.

pipeline

Main Results

3D Object Detection

img.png

HD-Map Reconstruction

img.png

Occupancy Predict

img.png

Getting Started

Installation

This project is based on MMDetection3D, which can be constructed as follows.

``` conda create -n gaussianpretrain python=3.8 conda activate gaussianpretrain conda install pytorch==1.9.1 torchvision==0.10.1 torchaudio==0.9.1 cudatoolkit=11.1 -c pytorch -c conda-forge

pip install mmcv-full==1.3.11 -f https://download.openmmlab.com/mmcv/dist/cu111/torch1.9/index.html pip install mmdet==2.14.0 mmsegmentation==0.14.1 tifffile-2021.11.2 numpy==1.19.5 protobuf==3.19.4 scikit-image==0.19.2 pycocotools==2.0.0 nuscenes-devkit==1.0.5 spconv-cu111 gpustat numba scipy pandas matplotlib Cython shapely loguru tqdm future fire yacs jupyterlab scikit-image pybind11 tensorboardX tensorboard easydict pyyaml open3d addict pyquaternion awscli timm typing-extensions==4.7.1

cd GaussianPretrain python setup.py develop cd projects/mmdet3d_plugin/ops/diff-gaussian-rasterization python setup.py develop ```

Data Preparation

Please follow the instruction of UVTR and PanoOCC to prepare the dataset.

Training & Testing

You can train and eval the model following the instructions. For example: ```

run gaussian pretrain on 8 GPUS

bash tools/disttrain.sh projects/mmdet3dplugin/configs/gaussianpretrain/gp0.075convnext.py 8

run downstream task ft on 8 GPUS

bash tools/disttrain.sh projects/mmdet3dplugin/configs/gaussianpretrain/uvtrdnft.py 8

run eval

python tools/test.py $config $ckpt --eval bbox ```

Weights

1. Object Detection

| Method | Pretrained ckpt | Config | NDS | mAP | Model | |---------------|-----|--------------|-------|------|-------- | UVTR-C+GP | Pretrained |UVTR-C | 47.2 | 41.7 | Google | UVTR-C+GP | Pretrained |UVTR-CS | 50.0 | 42.3 | Google

2. HD-Map Reconstruction

| Method | Pretrained ckpt | Config | mAP | Model |--------------------|----------------|--------|---------|-------- | MapTR-tiny†+GP | Pretrained |MapTR-tiny | 42.42 | Google

3. Occupancy Predict

| Method | Pretrained ckpt | Config | mIoU | Model | |--------|-----------------------------------------------------------------------------------------------------|---------------|---------|---------------- |BEVFormerOCC+GP| Pretrained | BEVFormerOCC | 24.21 | Google |PanoOCC+GP| Pretrained | PanoOCC | 42.62 | Google

TODO

  • streampetr version will publish soon.
  • Project Page.

Acknowledgement

This project is mainly based on the following codebases. Thanks for their great works! - UVTR - UniPAD - MMDetection3D

Owner

  • Name: Anon-BOTs
  • Login: Anon-BOTs
  • Kind: user

Anonymous repos for reproducing the performance of the code in the manuscript paper.

Citation (CITATION.cff)

cff-version: 1.2.0
message: "If you use this software, please cite it as below."
authors:
  - name: "MMDetection3D Contributors"
title: "OpenMMLab's Next-generation Platform for General 3D Object Detection"
date-released: 2020-07-23
url: "https://github.com/open-mmlab/mmdetection3d"
license: Apache-2.0

GitHub Events

Total
  • Issues event: 2
  • Watch event: 10
  • Push event: 15
Last Year
  • Issues event: 2
  • Watch event: 10
  • Push event: 15

Issues and Pull Requests

Last synced: 10 months ago

All Time
  • Total issues: 2
  • Total pull requests: 0
  • Average time to close issues: N/A
  • Average time to close pull requests: N/A
  • Total issue authors: 2
  • Total pull request authors: 0
  • Average comments per issue: 0.0
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 2
  • Pull requests: 0
  • Average time to close issues: N/A
  • Average time to close pull requests: N/A
  • Issue authors: 2
  • Pull request authors: 0
  • Average comments per issue: 0.0
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
  • Anon-BOTs (1)
  • pengxuanyang (1)
Pull Request Authors
Top Labels
Issue Labels
Pull Request Labels