https://github.com/cv516buaa/ovgnet

https://github.com/cv516buaa/ovgnet

Science Score: 23.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
  • DOI references
  • Academic publication links
    Links to: arxiv.org
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (7.9%) to scientific vocabulary
Last synced: 10 months ago · JSON representation

Repository

Basic Info
  • Host: GitHub
  • Owner: cv516Buaa
  • Language: Python
  • Default Branch: main
  • Size: 25.2 MB
Statistics
  • Stars: 3
  • Watchers: 1
  • Forks: 0
  • Open Issues: 1
  • Releases: 0
Created over 2 years ago · Last pushed almost 2 years ago
Metadata Files
Readme

README.md

OVGNet: An Unified Visual-Linguistic Framework for Open-Vocabulary Robotic Grasping


Meng Li · Qi Zhao · Shuchang Lyu · Chunlei Wang · Yujing Ma · Guangliang Cheng · Chenguang Yang

Highlight!!!!

This repo is the implementation of "OVGNet: An Unified Visual-Linguistic Framework for Open-Vocabulary Robotic Grasping". we refer to Vision-Language-Grasping, GroundingDINO, VL-Grasp. Many thanks to these excellent repos.

Demo Setting

  • Novel indicates the unseen objects in training.
  • Base denotes the seen objects in training.
  • Battery and power drill are novel classes, which belong to hard task.
  • Apple and pear are base classes, which belong to simple task.

Demo Video

Demo

Dataset

  • OVGrasping follows GroundingDINO data format.
  • The OVGrapsing dataset comprises 117 categories and 63,385 instances.
  • Instances are sourced from three distinct origins: RoboRefIt, GraspNet, simulated environment.
  • The dataset is divided into two categories: the base category consists 51,857 instances, and the novel category comprises 11,528 instances.

Installation

  • Ubantu==18.04
  • Python==3.9
  • Torch==1.11, Torchvision==0.12.0
  • CUDA==11.3
  • checkpoint==OVGANet
  • assets==assets

please add the assets into OVGNet folder
please ensure the CUDA version is 11.3 conda create -n OVGNet python=3.9 conda activate OVGNet pip install torch==1.11.0+cu113 torchvision==0.12.0+cu113 torchaudio==0.11.0 --extra-index-url https://download.pytorch.org/whl/cu113

cd /OVGNet/ pip install -r requirments.txt

cd graspnet/graspnet/pointnet2 python setup.py install cd graspnet/graspnet/knn python setup.py install cd groundingdino pip install -e .

Run

cd /OVGNet/ python test.py --testing_case_dir ./test_cases/simple/apple --pretrain ./checkpoint/OVGANet

Test on OVGrasping

cd /OVGNet/test_vg/ python test_vg.py --c ./config/cfg_odvg.py --datasets ./config/datasets_vg_example.json --pretrain_model_path OVGNet/checkpoint/OVGANet

Cite

``` @InProceedings{Li2024IROS, author = {Li Meng and Zhao Qi and Lyu Shuchang and Wang Chunlei and Ma Yujing and Cheng Guangliang and Yang Chenguang}, title = {OVGNet: A Unified Visual-Linguistic Framework for Open-Vocabulary Robotic Grasping}, year = {2024}, eprint = {2407.13175}, archivePrefix = {arXiv}, primaryClass = {cs.RO}, url = {https://arxiv.org/abs/2407.13175}, }

Owner

  • Name: cv516Buaa
  • Login: cv516Buaa
  • Kind: user
  • Location: Beijing,China
  • Company: Beihang University

Pattern Recognition and Artificial Intelligence Group Prof.Qi Zhao & Lijiang Chen Dr. Shuchang Lyu & Binghao Liu & Chunlei Wang

GitHub Events

Total
  • Issues event: 2
  • Watch event: 8
  • Issue comment event: 2
  • Fork event: 3
Last Year
  • Issues event: 2
  • Watch event: 8
  • Issue comment event: 2
  • Fork event: 3

Dependencies

graspnet/graspnet/knn/setup.py pypi
graspnet/graspnet/pointnet2/setup.py pypi
graspnet/graspnet/requirements.txt pypi
  • Pillow *
  • numpy *
  • open3d ==0.8
  • scipy *
  • tensorboard ==2.3
  • torch ==1.6
  • tqdm *
groundingdino/requirements.txt pypi
  • addict *
  • numpy *
  • opencv-python *
  • pycocotools *
  • supervision *
  • timm *
  • torch *
  • torchvision *
  • transformers *
  • yapf *
groundingdino/setup.py pypi
requirements.txt pypi
  • Pillow ==9.2.0
  • PyYAML ==6.0
  • Requests ==2.31.0
  • addict ==2.4.0
  • fiftyone ==0.23.5
  • ftfy ==6.1.3
  • graspnetAPI ==1.2.10
  • huggingface_hub ==0.17.3
  • ipdb ==0.13.13
  • jsonlines ==4.0.0
  • matplotlib ==3.3.3
  • numpy ==1.23.5
  • open3d ==0.15.2
  • open3d_plus ==0.1.0
  • opencv_python ==4.4.0.46
  • pybullet ==3.0.7
  • regex ==2023.10.3
  • scipy ==1.12.0
  • setuptools ==68.0.0
  • supervision ==0.6.0
  • termcolor ==1.1.0
  • timm ==0.9.8
  • tqdm ==4.57.0
  • transformers ==4.34.1
  • typer ==0.9.0
  • yapf ==0.40.1
test_vg/models/GroundingDINO/ops/setup.py pypi
test_vg/requirements.txt pypi
  • addict * test
  • colorlog * test
  • cython * test
  • numpy * test
  • opencv-python * test
  • pycocotools * test
  • pyyaml >3.10 test
  • scipy * test
  • submitit * test
  • supervision ==0.6.0 test
  • termcolor * test
  • timm * test
  • torch * test
  • torchvision * test
  • transformers * test
  • yapf ==0.40.1 test