https://github.com/ccomkhj/cvops

Simplify COCO dataset management with an intuitive GUI for visualization, merging, splitting, and updating. Perfect for ML and computer vision projects.

Science Score: 26.0%

This score indicates how likely this project is to be science-related based on various indicators:

○
CITATION.cff file
✓
codemeta.json file
Found codemeta.json file
✓
.zenodo.json file
Found .zenodo.json file
○
DOI references
○
Academic publication links
○
Academic email domains
○
Institutional organization owner
○
JOSS paper metadata
○
Scientific vocabulary similarity
Low similarity (13.7%) to scientific vocabulary

Keywords

annotation-tool coco computer-vision cv labeling-tool pycocotools

Last synced: 5 months ago · JSON representation

Repository

Simplify COCO dataset management with an intuitive GUI for visualization, merging, splitting, and updating. Perfect for ML and computer vision projects.

Basic Info

Host: GitHub
Owner: ccomkhj
Language: Python
Default Branch: main
Homepage:
Size: 3.6 MB

Statistics

Stars: 5
Watchers: 2
Forks: 3
Open Issues: 2
Releases: 0

Topics

annotation-tool coco computer-vision cv labeling-tool pycocotools

Created about 3 years ago · Last pushed 10 months ago

Metadata Files

Readme

cvOps

cvOps provides an intuitive Graphical User Interface (GUI) for the streamlined management of COCO datasets. Utilizing the versatility of PyQt5, it stands as an essential toolkit for researchers and developers focused on machine learning and computer vision, simplifying critical tasks such as visualizing, merging, splitting, updating, and post-updating COCO datasets.

Getting Started

Install: bash pip install -e . pip install git+https://github.com/ccomkhj/COCO-Assistant.git@master

Check if all test is successful using below. bash pytest

Launch: Activate the COCO Tools GUI by executing the accompanying script.

bash python3 cvops/main_window.py

2. Usage:

The main interface presents options for Visualization, Merging, Splitting, Updating, and Post Updating of datasets. - Visualize: To visualize COCO datasets. - Remap Categories: Remap the sequence of categories. - Merge: For merging datasets, indicate the image and annotation directories of the datasets to be combined. Specify if the image sets should also be merged. Note that directory name (i.e. sample1) and COCO file (i.e. sample1.json) should match. - . ├── images_dir │ ├── sample1 │ ├── sample2 │ ├── sample3 │ ├── anns_dir │ ├── sample1.json │ ├── sample2.json │ ├── sample3.json - Separate by Name: Separate a dataset into multiple subsets based on parts of the image file names. This requires providing the path to the image files, the path to the COCO annotations file, and a list of name keys. The dataset will be split such that images matching each name key are grouped together, and new COCO annotation files will be created for each subset. - Split: Splitting a dataset requires providing an annotation file and image directory, along with specifying the split ratio for training set allocation. - Update (Local) : To run through Merge and Split in one-shot, select new annotations, and specify existing training and validation annotation files along with a new image location. : Split (New Sample) -> Merge (Between New and Existing Samples for Train and Val Each) - Update (S3): Same feature with Update (Local) but files are in AWS S3. Tip: Directly click the Copy S3 URI from the web interface. Configure config/s3_credentials.yaml - Post Update: Execute post-update operations by choosing directories for new and existing samples. This ensures the dataset is optimized and organized according to the standard structure post-update. After checking the quality, you have an option to directly upload your updated project into S3. [Note] Even if you run S3 Update, still need to take Post Update.

Data Structure: After using cvOps, all data is structured as below. It supports other computer vision projects accordingly. (SAHI, MMDET)

After performing the post-update process, the dataset will adhere to a standardized directory structure:

/ |-train_images/ |-val_images/ |-train.json |-val.json

This structure organizes training and validation images into separate folders (train_images, val_images), with corresponding annotation files (train.json, val.json) located in the root dataset directory. This clear and efficient organization facilitates easy access and dataset management, crucial for training machine learning models effectively.

Only then, you can use Update (S3). (Directly update project on AWS S3 bucket.) For this, config/s3_credentials.yaml is required.

yaml aws_secret_access_key: {aws_secret_access_key} aws_access_key_id: {aws_access_key_id}

COCO Tools

Contributing

We welcome and encourage contributions! Fork the repository and submit pull requests to propose new features or enhancements to the cvOps GUI tool, aiming to improve its utility in managing COCO datasets more efficiently and effectively.

Owner

Name: Huijo
Login: ccomkhj
Kind: user
Location: Germany
Company: @hexafarms

Website: https://ccomkhj.github.io/
Repositories: 3
Profile: https://github.com/ccomkhj

Self Learner

GitHub Events

Total

Push event: 2
Fork event: 1

Last Year

Push event: 2
Fork event: 1

Issues and Pull Requests

Last synced: 10 months ago

All Time

Total issues: 2
Total pull requests: 2
Average time to close issues: N/A
Average time to close pull requests: 5 days
Total issue authors: 1
Total pull request authors: 1
Average comments per issue: 0.0
Average comments per pull request: 0.0
Merged pull requests: 2
Bot issues: 0
Bot pull requests: 0

Past Year

Issues: 1
Pull requests: 0
Average time to close issues: N/A
Average time to close pull requests: N/A
Issue authors: 1
Pull request authors: 0
Average comments per issue: 0.0
Average comments per pull request: 0
Merged pull requests: 0
Bot issues: 0
Bot pull requests: 0

View more stats

Top Authors

Issue Authors

ccomkhj (2)

Pull Request Authors

ymincoding (2)

Top Labels

Issue Labels

enhancement (1)

Pull Request Labels

Dependencies

requirements/git-requirements.txt pypi

requirements/requirements.txt pypi

PyQt5 *
fiftyone *
funcy *
loguru *
pytest *
pytest-mock *
pyyaml *
rich *
scikit-learn *
scikit-multilearn *
typer *

setup.py pypi

line.strip *

requirements/extra-requirements.txt pypi

fiftyone *

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Open Source Science

https://github.com/ccomkhj/cvops

Science Score: 26.0%

Keywords

Repository

Basic Info

Statistics

Topics

Metadata Files

readme.MD

cvOps

Getting Started

2. Usage:

Contributing

Owner

GitHub Events

Total

Last Year

Issues and Pull Requests

All Time

Past Year

Top Authors

Issue Authors

Pull Request Authors

Top Labels

Issue Labels

Pull Request Labels

Dependencies