https://github.com/cv516buaa/ovgnet
Science Score: 23.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
○CITATION.cff file
-
✓codemeta.json file
Found codemeta.json file -
○.zenodo.json file
-
○DOI references
-
✓Academic publication links
Links to: arxiv.org -
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (7.9%) to scientific vocabulary
Repository
Basic Info
- Host: GitHub
- Owner: cv516Buaa
- Language: Python
- Default Branch: main
- Size: 25.2 MB
Statistics
- Stars: 3
- Watchers: 1
- Forks: 0
- Open Issues: 1
- Releases: 0
Metadata Files
README.md
OVGNet: An Unified Visual-Linguistic Framework for Open-Vocabulary Robotic Grasping
Meng Li
·
Qi Zhao
·
Shuchang Lyu
·
Chunlei Wang
·
Yujing Ma
·
Guangliang Cheng
·
Chenguang Yang
Highlight!!!!
This repo is the implementation of "OVGNet: An Unified Visual-Linguistic Framework for Open-Vocabulary Robotic Grasping". we refer to Vision-Language-Grasping, GroundingDINO, VL-Grasp. Many thanks to these excellent repos.
Demo Setting
- Novel indicates the unseen objects in training.
- Base denotes the seen objects in training.
- Battery and power drill are novel classes, which belong to hard task.
- Apple and pear are base classes, which belong to simple task.
Demo Video
Dataset
- OVGrasping follows GroundingDINO data format.
- The OVGrapsing dataset comprises 117 categories and 63,385 instances.
- Instances are sourced from three distinct origins: RoboRefIt, GraspNet, simulated environment.
- The dataset is divided into two categories: the base category consists 51,857 instances, and the novel category comprises 11,528 instances.
Installation
- Ubantu==18.04
- Python==3.9
- Torch==1.11, Torchvision==0.12.0
- CUDA==11.3
- checkpoint==OVGANet
- assets==assets
please add the assets into OVGNet folder
please ensure the CUDA version is 11.3
conda create -n OVGNet python=3.9
conda activate OVGNet
pip install torch==1.11.0+cu113 torchvision==0.12.0+cu113 torchaudio==0.11.0 --extra-index-url https://download.pytorch.org/whl/cu113
cd /OVGNet/
pip install -r requirments.txt
cd graspnet/graspnet/pointnet2
python setup.py install
cd graspnet/graspnet/knn
python setup.py install
cd groundingdino
pip install -e .
Run
cd /OVGNet/
python test.py --testing_case_dir ./test_cases/simple/apple --pretrain ./checkpoint/OVGANet
Test on OVGrasping
cd /OVGNet/test_vg/
python test_vg.py --c ./config/cfg_odvg.py --datasets ./config/datasets_vg_example.json --pretrain_model_path OVGNet/checkpoint/OVGANet
Cite
``` @InProceedings{Li2024IROS, author = {Li Meng and Zhao Qi and Lyu Shuchang and Wang Chunlei and Ma Yujing and Cheng Guangliang and Yang Chenguang}, title = {OVGNet: A Unified Visual-Linguistic Framework for Open-Vocabulary Robotic Grasping}, year = {2024}, eprint = {2407.13175}, archivePrefix = {arXiv}, primaryClass = {cs.RO}, url = {https://arxiv.org/abs/2407.13175}, }
Owner
- Name: cv516Buaa
- Login: cv516Buaa
- Kind: user
- Location: Beijing,China
- Company: Beihang University
- Repositories: 2
- Profile: https://github.com/cv516Buaa
Pattern Recognition and Artificial Intelligence Group Prof.Qi Zhao & Lijiang Chen Dr. Shuchang Lyu & Binghao Liu & Chunlei Wang
GitHub Events
Total
- Issues event: 2
- Watch event: 8
- Issue comment event: 2
- Fork event: 3
Last Year
- Issues event: 2
- Watch event: 8
- Issue comment event: 2
- Fork event: 3
Dependencies
- Pillow *
- numpy *
- open3d ==0.8
- scipy *
- tensorboard ==2.3
- torch ==1.6
- tqdm *
- addict *
- numpy *
- opencv-python *
- pycocotools *
- supervision *
- timm *
- torch *
- torchvision *
- transformers *
- yapf *
- Pillow ==9.2.0
- PyYAML ==6.0
- Requests ==2.31.0
- addict ==2.4.0
- fiftyone ==0.23.5
- ftfy ==6.1.3
- graspnetAPI ==1.2.10
- huggingface_hub ==0.17.3
- ipdb ==0.13.13
- jsonlines ==4.0.0
- matplotlib ==3.3.3
- numpy ==1.23.5
- open3d ==0.15.2
- open3d_plus ==0.1.0
- opencv_python ==4.4.0.46
- pybullet ==3.0.7
- regex ==2023.10.3
- scipy ==1.12.0
- setuptools ==68.0.0
- supervision ==0.6.0
- termcolor ==1.1.0
- timm ==0.9.8
- tqdm ==4.57.0
- transformers ==4.34.1
- typer ==0.9.0
- yapf ==0.40.1
- addict * test
- colorlog * test
- cython * test
- numpy * test
- opencv-python * test
- pycocotools * test
- pyyaml >3.10 test
- scipy * test
- submitit * test
- supervision ==0.6.0 test
- termcolor * test
- timm * test
- torch * test
- torchvision * test
- transformers * test
- yapf ==0.40.1 test