webvision1.0

Process WebVision1.0 Data to have ImageNet Directory structure for pytorch dataloader.

https://github.com/sangamesh-kodge/webvision1.0

Science Score: 44.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (8.8%) to scientific vocabulary
Last synced: 6 months ago · JSON representation ·

Repository

Process WebVision1.0 Data to have ImageNet Directory structure for pytorch dataloader.

Basic Info
Statistics
  • Stars: 0
  • Watchers: 1
  • Forks: 0
  • Open Issues: 0
  • Releases: 0
Created about 2 years ago · Last pushed about 2 years ago
Metadata Files
Readme License Citation

Readme.md

WebVision 1.0

This project preprocess the WebVision 1.0 Dataset and gives a directory structure ImageNet1k dataset.

Dataset details

Check using find train/ -maxdepth 2 -type f | wc -l in terminal

  • Number of train images - 2345405
  • Number of val images - 50000 (50 per class)

Below is the final directory structure:

└── WebVision1.0 ├── train | ├── nxxxxxxxx | ├── nxxxxxxxx | └── ... | └── val | ├── nxxxxxxxx | ├── nxxxxxxxx | └── ... | └── info | ├── xxxx | ├── xxx | └── ... | └── create_WebVision_as_ImageNet.sh └── helper.py └── Readme.md └── Citation.cff └── LICENSE

Note Similar Processing can be done for the Flicker partition. Modify the createWebVisionas_ImageNet.sh and helper.py.

Instructions

  1. Clone this repository
  2. Navigate to the root of this project
  3. Run the following command in your terminal/command prompt:

    bash sh create_WebVision_as_ImageNet.sh

Expected Terminal logs

```

Creating directory structure similar to ImageNet for training dataset


Creating directory structure similar to ImageNet for val dataset


Removing Redundant files.


WebVision Dataset 1.0 Processed!

```

Conclusion

The project preprocess WebVision 1.0 Dataset and gives a directory structure similar to ImageNet. The script automates the preprocessing and provides a directory structure for WebVision similar to ImageNet.

Citation

Kindly cite the repository if you use the code. Thanks!

APA

Kodge, S. (2024). WebVision1.0 [Computer software]. https://github.com/sangamesh-kodge/WebVision1.0

Bibtex

@software{Kodge_WebVision1_0_2024, author = {Kodge, Sangamesh}, month = feb, title = {{WebVision1.0}}, url = {https://github.com/sangamesh-kodge/WebVision1.0}, year = {2024} }

Owner

  • Login: sangamesh-kodge
  • Kind: user

Citation (CITATION.cff)

cff-version: 1.2.0
message: "If you use this software, please cite it as below."
authors:
- family-names: "Kodge"
  given-names: "Sangamesh"
  orcid: "https://orcid.org/0000-0001-9713-5400"
title: "WebVision1.0"
date-released: 2024-2-7
url: "https://github.com/sangamesh-kodge/WebVision1.0"

GitHub Events

Total
Last Year