https://github.com/cv516buaa/ovgnet

Science Score: 23.0%

This score indicates how likely this project is to be science-related based on various indicators:

○
CITATION.cff file
✓
codemeta.json file
Found codemeta.json file
○
.zenodo.json file
○
DOI references
✓
Academic publication links
Links to: arxiv.org
○
Academic email domains
○
Institutional organization owner
○
JOSS paper metadata
○
Scientific vocabulary similarity
Low similarity (7.9%) to scientific vocabulary

Last synced: 10 months ago · JSON representation

Repository

Basic Info

Host: GitHub
Owner: cv516Buaa
Language: Python
Default Branch: main
Size: 25.2 MB

Statistics

Stars: 3
Watchers: 1
Forks: 0
Open Issues: 1
Releases: 0

Created over 2 years ago · Last pushed almost 2 years ago

Metadata Files

Readme

OVGNet: An Unified Visual-Linguistic Framework for Open-Vocabulary Robotic Grasping

Meng Li · Qi Zhao · Shuchang Lyu · Chunlei Wang · Yujing Ma · Guangliang Cheng · Chenguang Yang

Highlight!!!!

This repo is the implementation of "OVGNet: An Unified Visual-Linguistic Framework for Open-Vocabulary Robotic Grasping". we refer to Vision-Language-Grasping, GroundingDINO, VL-Grasp. Many thanks to these excellent repos.

Demo Setting

Novel indicates the unseen objects in training.
Base denotes the seen objects in training.
Battery and power drill are novel classes, which belong to hard task.
Apple and pear are base classes, which belong to simple task.

Demo Video

Demo

Dataset

OVGrasping follows GroundingDINO data format.
The OVGrapsing dataset comprises 117 categories and 63,385 instances.
Instances are sourced from three distinct origins: RoboRefIt, GraspNet, simulated environment.
The dataset is divided into two categories: the base category consists 51,857 instances, and the novel category comprises 11,528 instances.

Installation

Ubantu==18.04
Python==3.9
Torch==1.11, Torchvision==0.12.0
CUDA==11.3
checkpoint==OVGANet
assets==assets

please add the assets into OVGNet folder
please ensure the CUDA version is 11.3 conda create -n OVGNet python=3.9 conda activate OVGNet pip install torch==1.11.0+cu113 torchvision==0.12.0+cu113 torchaudio==0.11.0 --extra-index-url https://download.pytorch.org/whl/cu113

cd /OVGNet/ pip install -r requirments.txt

cd graspnet/graspnet/pointnet2 python setup.py install cd graspnet/graspnet/knn python setup.py install cd groundingdino pip install -e .

Run

cd /OVGNet/ python test.py --testing_case_dir ./test_cases/simple/apple --pretrain ./checkpoint/OVGANet

Test on OVGrasping

cd /OVGNet/test_vg/ python test_vg.py --c ./config/cfg_odvg.py --datasets ./config/datasets_vg_example.json --pretrain_model_path OVGNet/checkpoint/OVGANet

Cite

``` @InProceedings{Li2024IROS, author = {Li Meng and Zhao Qi and Lyu Shuchang and Wang Chunlei and Ma Yujing and Cheng Guangliang and Yang Chenguang}, title = {OVGNet: A Unified Visual-Linguistic Framework for Open-Vocabulary Robotic Grasping}, year = {2024}, eprint = {2407.13175}, archivePrefix = {arXiv}, primaryClass = {cs.RO}, url = {https://arxiv.org/abs/2407.13175}, }

Owner

Name: cv516Buaa
Login: cv516Buaa
Kind: user
Location: Beijing，China
Company: Beihang University

Repositories: 2
Profile: https://github.com/cv516Buaa

Pattern Recognition and Artificial Intelligence Group Prof.Qi Zhao & Lijiang Chen Dr. Shuchang Lyu & Binghao Liu & Chunlei Wang

GitHub Events

Total

Issues event: 2
Watch event: 8
Issue comment event: 2
Fork event: 3

Last Year

Issues event: 2
Watch event: 8
Issue comment event: 2
Fork event: 3

Dependencies

graspnet/graspnet/knn/setup.py pypi

graspnet/graspnet/pointnet2/setup.py pypi

graspnet/graspnet/requirements.txt pypi

Pillow *
numpy *
open3d ==0.8
scipy *
tensorboard ==2.3
torch ==1.6
tqdm *

groundingdino/requirements.txt pypi

addict *
numpy *
opencv-python *
pycocotools *
supervision *
timm *
torch *
torchvision *
transformers *
yapf *

groundingdino/setup.py pypi

requirements.txt pypi

Pillow ==9.2.0
PyYAML ==6.0
Requests ==2.31.0
addict ==2.4.0
fiftyone ==0.23.5
ftfy ==6.1.3
graspnetAPI ==1.2.10
huggingface_hub ==0.17.3
ipdb ==0.13.13
jsonlines ==4.0.0
matplotlib ==3.3.3
numpy ==1.23.5
open3d ==0.15.2
open3d_plus ==0.1.0
opencv_python ==4.4.0.46
pybullet ==3.0.7
regex ==2023.10.3
scipy ==1.12.0
setuptools ==68.0.0
supervision ==0.6.0
termcolor ==1.1.0
timm ==0.9.8
tqdm ==4.57.0
transformers ==4.34.1
typer ==0.9.0
yapf ==0.40.1

test_vg/models/GroundingDINO/ops/setup.py pypi

test_vg/requirements.txt pypi

addict * test
colorlog * test
cython * test
numpy * test
opencv-python * test
pycocotools * test
pyyaml >3.10 test
scipy * test
submitit * test
supervision ==0.6.0 test
termcolor * test
timm * test
torch * test
torchvision * test
transformers * test
yapf ==0.40.1 test

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Open Source Science