clip

https://github.com/comp3340-2024-gp11/clip

Science Score: 44.0%

This score indicates how likely this project is to be science-related based on various indicators:

✓
CITATION.cff file
Found CITATION.cff file
✓
codemeta.json file
Found codemeta.json file
✓
.zenodo.json file
Found .zenodo.json file
○
DOI references
○
Academic publication links
○
Academic email domains
○
Institutional organization owner
○
JOSS paper metadata
○
Scientific vocabulary similarity
Low similarity (12.9%) to scientific vocabulary

Last synced: 9 months ago · JSON representation ·

Repository

Basic Info

Host: GitHub
Owner: COMP3340-2024-GP11
License: apache-2.0
Language: Python
Default Branch: main
Size: 13.8 MB

Statistics

Stars: 0
Watchers: 0
Forks: 0
Open Issues: 0
Releases: 0

Created about 2 years ago · Last pushed about 2 years ago

Metadata Files

Readme Contributing License Citation

COMP3340 Group 10 - CLIP + Data Augmentation with Segmentation Image

Contact

This repository contains code for CLIP Zero-shot Classification on Oxford 17 Dataset and data augmentation with segmentation image
For any question and enquiry, please feel free to reach out to Zhong Zhiyi(zhongzy@connect.hku.hk)

Overview

Prerequisite for Reproduction

Set up conda environment
Download data and put them under the correct folder
Run the commands to produce the results

Software, Hardware & System Requirements

Software (Set up environment as following)
- python==3.8.18
- mmcv==2.1.0
- mmengine==0.10.3
- mmpretrain==1.2.0
- numpy==1.24.4
- opencv-python==4.9.0.80
Hardware (HKU GPU Farm)
- Experiments are conducted on one NVIDIA GeForce RTX 2080 Ti
System
- Linux

Environment setup

Basic Setup

Step 1. Create virtual environment using anaconda

# Please make sure that you are create a virtual env with python version 3.8

conda create -n mmpretrain python=3.8 -y conda activate mmpretrain

Step 2. Install Pytorch from wheel

# For those interested, please see https://download.pytorch.org/whl/torch/

wget https://download.pytorch.org/whl/cu111/torch-1.10.1%2Bcu111-cp38-cp38-linux_x86_64.whl#sha256=3d35d58cadb5abbfa25a474a33598a6bdc168c4306c3c20968159e6f3a4a2e46 pip install torch-1.10.1+cu111-cp38-cp38-linux_x86_64.whl pip install numpy --upgrade

Please double check that you install the GPU version of pytorch using the following command

``` python

import torch torch.version ```

the correct output should be

'1.10.1+cu111'

Step 3 Install cudatoolkit via conda-forge channel

# You must be on the GPU compute node to install cudatoolkit and mmcv since GCC compiler and CUDA drivers only available on GPU computing nodes

gpu-interactive conda activate mmpretrain conda install cudatoolkit=11.1 -c pytorch -c conda-forge -y

Step 4 Install torchvision, openmim package using pip & use openmim to install mmpretrain from source

# make sure you are on GPU compute node

gpu-interactive

# install packages

conda activate mmpretrain pip install torchvision==0.11.2 git clone https://github.com/COMP3340-2024-GP11/CLIP.git cd CLIP pip install -U openmim && mim install -e .

Step 5 An additional step to use CLIP

pip install -q --extra-index-url https://download.pytorch.org/whl/cpu gradio "openvino>=2023.1.0" "transformers[torch]>=4.30" "datasets" "nncf>=2.6.0"

It does not matter if an error occurs like "ERROR: pip's dependency resolver does not currently take into account all the packages that are installed. This behaviour is the source of the following dependency conflicts..."

Download data and manipulate data

```

download flowers data

wget https://www.robots.ox.ac.uk/~vgg/data/flowers/17/17flowers.tgz tar zxvf 17flowers.tgz mv 17flowers data/flower cd data

copy split.py generate_meta.py to flower directory to split the data

cp split.py flower cp generatemeta.py flower cd flower python split.py mkdir meta python generatemeta.py cd .. ```

Commands to reproduce results

Part A: Zero-shot Classification with CLIP

First, please make sure you are in the main directory CLIP instead of CLIP/data

Then, you can directly run the program clip.py by

python clip.py

The program will print the test accuracy of clip from class 0 to class 16 and finally the general accuracy (the testing of every class might take 4 to 5 minutes):

class 0: 1.0 class 1: 0.9875 ...

Alternatively, you can open any IDE that supports Python notebook and open the file clip.ipynb to see the visualization demo of a CLIP inference with or without prompts. If you would like to view the inference result of different images, just change the file name in the correct file name format as specified in the file.

Note: the result might have slight changes in different running

Part B: Data augmentation using segmentation image

First, download the groundtruth segmentation image to the data folder

```

download segmentation image data and please make sure you are in the main directory

cd data wget https://www.robots.ox.ac.uk/~vgg/data/flowers/17/trimaps.tgz
tar zxvf trimaps.tgz ```

Then, you can use generate_segment.py to augment data with segmentation masks and add perturbations in the background. You can operate as follows:

python generate_segment.py

Split the data again

cp split.py flowers cp generate_meta.py flowers cd flowers python split.py mkdir meta python generate_meta.py cd ..

After this step, all images in the folder 'flowers' have been augmented and can be used for training

cd .. #make sure you are in the main directory python tools/train.py \ configs/resnet/resnet18_flowers_bs128.py \ --work-dir output/resnet18_flowers_bs128

After training, you use the following command to test the model (as I have modified configs/base/datasets/flowers.py, the testing will still be done on the original data)

python tools/test.py configs/tinyvit/resnet/resnet18_flowers_bs128.py output/resnet18_flowers_bs128/epoch_200.pth --out output/resnet18_flowers_bs128/test.pkl

After this, all have been done. Thanks for your patience!

Owner

Name: COMP3340 Group 10
Login: COMP3340-2024-GP11
Kind: organization

Repositories: 1
Profile: https://github.com/COMP3340-2024-GP11

HKU COMP3340 Group 10 (2023-24 Second Semester)

Citation (CITATION.cff)

cff-version: 1.2.0
message: "If you use this software, please cite it as below."
title: "OpenMMLab's Pre-training Toolbox and Benchmark"
authors:
  - name: "MMPreTrain Contributors"
version: 0.15.0
date-released: 2023-04-06
repository-code: "https://github.com/open-mmlab/mmpretrain"
license: Apache-2.0

GitHub Events

Total

Last Year

Dependencies

.github/workflows/lint.yml actions

actions/checkout v3 composite
actions/setup-python v4 composite

.github/workflows/pr_stage_test.yml actions

actions/checkout v3 composite
actions/setup-python v4 composite
codecov/codecov-action v1.0.14 composite

.github/workflows/publish-to-pypi.yml actions

actions/checkout v3 composite
actions/setup-python v4 composite

.github/workflows/test_mim.yml actions

actions/checkout v3 composite
actions/setup-python v4 composite

tests/data/meta.yml cpan

.circleci/docker/Dockerfile docker

pytorch/pytorch ${PYTORCH}-cuda${CUDA}-cudnn${CUDNN}-devel build

docker/Dockerfile docker

pytorch/pytorch ${PYTORCH}-cuda${CUDA}-cudnn${CUDNN}-devel build

docker/serve/Dockerfile docker

pytorch/torchserve latest-gpu build

projects/internimage_classification/ops_dcnv3/setup.py pypi

requirements/docs.txt pypi

docutils ==0.18.1
modelindex *
myst-parser *
pytorch_sphinx_theme *
sphinx ==6.1.3
sphinx-copybutton *
sphinx-notfound-page *
sphinx-tabs *
sphinxcontrib-jquery *
tabulate *

requirements/mminstall.txt pypi

mmcv >=2.0.0,<2.4.0
mmengine >=0.8.3,<1.0.0

requirements/multimodal.txt pypi

pycocotools *
transformers >=4.28.0

requirements/optional.txt pypi

albumentations >=0.3.2
grad-cam >=1.3.7,<1.5.0
requests *
scikit-learn *

requirements/readthedocs.txt pypi

mmcv-lite >=2.0.0rc4
mmengine *
pycocotools *
torch *
torchvision *
transformers *

requirements/runtime.txt pypi

einops *
importlib-metadata *
mat4py *
matplotlib *
modelindex *
numpy *
rich *

requirements/tests.txt pypi

coverage * test
interrogate * test
pytest * test

requirements.txt pypi

setup.py pypi

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Open Source Science