https://github.com/amir22010/large-scale-vrd

Visual Relationship Detection

Science Score: 10.0%

This score indicates how likely this project is to be science-related based on various indicators:

○
CITATION.cff file
○
codemeta.json file
○
.zenodo.json file
○
DOI references
✓
Academic publication links
Links to: arxiv.org
○
Academic email domains
○
Institutional organization owner
○
JOSS paper metadata
○
Scientific vocabulary similarity
Low similarity (16.1%) to scientific vocabulary

Last synced: 9 months ago · JSON representation

Repository

Visual Relationship Detection

Basic Info

Host: GitHub
Owner: Amir22010
License: other
Language: Python
Default Branch: master
Size: 9.59 MB

Statistics

Stars: 0
Watchers: 2
Forks: 0
Open Issues: 0
Releases: 0

Fork of facebookresearch/Large-Scale-VRD

Created about 7 years ago · Last pushed about 7 years ago

https://github.com/Amir22010/Large-Scale-VRD/blob/master/

# Large-scale Visual Relationship Understanding ![alt text](https://github.com/facebookresearch/Large-Scale-VRD/blob/master/Examples.PNG)

Example results from the VG80K dataset.

This is the Caffe2 implementation for [Large-scale Visual Relationship Understanding, AAAI2019](https://arxiv.org/abs/1804.10660). This code is for the VG80K dataset only. For results on VG200 and VRD please refer to the [PyTorch implementation](https://github.com/jz462/Large-Scale-VRD.pytorch). **Note:** In this repo we use ground-truth boxes during testing, so there is no object detection module involved in this repo. ## Caffe2 To install Caffe2 with CUDA support, follow the [installation instructions](https://caffe2.ai/docs/getting-started.html) from the [Caffe2 website](https://caffe2.ai/). **If you already have Caffe2 installed, make sure to update your Caffe2 to a version that includes the [Detectron module](https://github.com/pytorch/pytorch/tree/master/modules/detectron).** Please ensure that your Caffe2 installation was successful before proceeding by running the following commands and checking their output as directed in the comments. ``` # To check if Caffe2 build was successful python2 -c 'from caffe2.python import core' 2>/dev/null && echo "Success" || echo "Failure" # To check if Caffe2 GPU build was successful # This must print a number > 0 in order to use Detectron python2 -c 'from caffe2.python import workspace; print(workspace.NumCudaDevices())' ``` If the `caffe2` Python package is not found, you likely need to adjust your `PYTHONPATH` environment variable to include its location (`/path/to/caffe2/build`, where `build` is the Caffe2 CMake build directory). ## Other Dependencies Install Python dependencies: ``` pip install numpy>=1.13 pyyaml>=3.12 matplotlib opencv-python>=3.2 setuptools Cython mock scipy ``` ## Large-scale-VRD Clone the Large-scale-VRD repository: ``` # Large-scale-VRD=/path/to/clone/Large-scale-VRD git clone https://github.com/fairinternal/VRD $Large-scale-VRD ``` Set up Python modules: ``` cd $Large-scale-VRD/lib && make ``` ## Annotations Download VG annotation files from [here](https://www.dropbox.com/s/minpyv59crdifk9/datasets.zip). Put the zip file under `$Large-scale-VRD` and unzip it. You should see a `datasets` folder unzipped there. ## Datasets Download VG80K images from [here](http://visualgenome.org/api/v0/api_home.html). Unzip all images into `$Large-scale-VRD/datasets/large_scale_VRD/Visual_Genome/images`. ## Pretrained Embedding Models Download pretrained embeddings from [here](https://www.dropbox.com/s/r6uh5n9h76k41w7/Ji%20Zhang%20-%20embeddings.zip). Put the zip file under `$Large-scale-VRD/datasets/large_scale_VRD` and unzip it. You should see a "label_embeddings" folder and a "models" folders there. ## Our Trained Models You can download our trained models from [here](https://www.dropbox.com/s/t5b1b2odn781035/checkpoints.zip). Put the zip file under `$Large-scale-VRD` and unzip it. You should see a `checkpoints` folder unzipped there. ## Training To train VG80K with 8 GPU, run: ``` python tools/train_net_rel.py --cfg configs/vg/VG_wiki_and_relco_VGG16_softmaxed_triplet_2_lan_layers_8gpu.yaml ``` ## Testing To test VG80K with 8 GPU, run: ``` python tools/test_net_rel.py --cfg configs/vg/VG_wiki_and_relco_VGG16_softmaxed_triplet_2_lan_layers_8gpu.yaml ``` ## License This project is licensed under the license found in the LICENSE file in the root directory of this source tree. Our revised annotations, linked above are based on Visual Genome which is licensed under: [Creative Commons Attribution 4.0 International Public License](https://creativecommons.org/licenses/by/4.0/). Our revised annotations are under Attribution-NonCommercial 4.0 International License which can be found under the LICENSE file in the root directory of this source tree. ## Citing Large-Scale-VRD If you use this code in your research, please use the following BibTeX entry. ``` @conference{zhang2018large, title={Large-Scale Visual Relationship Understanding}, author={Zhang, Ji and Kalantidis, Yannis and Rohrbach, Marcus and Paluri, Manohar and Elgammal, Ahmed and Elhoseiny, Mohamed}, booktitle={AAAI}, year={2019} }

Owner

Name: Amir Khan
Login: Amir22010
Kind: user
Location: India

Repositories: 3
Profile: https://github.com/Amir22010

working on developing a state of art AI solutions mainly in computer vision, chat bots and nlp domain. building an awesome AI as a professional developer 😍.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Open Source Science

https://github.com/amir22010/large-scale-vrd

Science Score: 10.0%

Repository

Basic Info

Statistics

https://github.com/Amir22010/Large-Scale-VRD/blob/master/

Owner

GitHub Events

Total

Last Year