webvision1.0
Process WebVision1.0 Data to have ImageNet Directory structure for pytorch dataloader.
Science Score: 44.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
○DOI references
-
○Academic publication links
-
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (8.8%) to scientific vocabulary
Repository
Process WebVision1.0 Data to have ImageNet Directory structure for pytorch dataloader.
Basic Info
- Host: GitHub
- Owner: sangamesh-kodge
- License: apache-2.0
- Language: Shell
- Default Branch: master
- Homepage: https://www.linkedin.com/in/sangamesh-kodge/
- Size: 8.79 KB
Statistics
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
- Releases: 0
Metadata Files
Readme.md
WebVision 1.0
This project preprocess the WebVision 1.0 Dataset and gives a directory structure ImageNet1k dataset.
Dataset details
Check using find train/ -maxdepth 2 -type f | wc -l in terminal
- Number of train images - 2345405
- Number of val images - 50000 (50 per class)
Below is the final directory structure:
└── WebVision1.0
├── train
| ├── nxxxxxxxx
| ├── nxxxxxxxx
| └── ...
|
└── val
| ├── nxxxxxxxx
| ├── nxxxxxxxx
| └── ...
|
└── info
| ├── xxxx
| ├── xxx
| └── ...
|
└── create_WebVision_as_ImageNet.sh
└── helper.py
└── Readme.md
└── Citation.cff
└── LICENSE
Note Similar Processing can be done for the Flicker partition. Modify the createWebVisionas_ImageNet.sh and helper.py.
Instructions
- Clone this repository
- Navigate to the root of this project
Run the following command in your terminal/command prompt:
bash sh create_WebVision_as_ImageNet.sh
Expected Terminal logs
```
Creating directory structure similar to ImageNet for training dataset
Creating directory structure similar to ImageNet for val dataset
Removing Redundant files.
WebVision Dataset 1.0 Processed!
```
Conclusion
The project preprocess WebVision 1.0 Dataset and gives a directory structure similar to ImageNet. The script automates the preprocessing and provides a directory structure for WebVision similar to ImageNet.
Citation
Kindly cite the repository if you use the code. Thanks!
APA
Kodge, S. (2024). WebVision1.0 [Computer software]. https://github.com/sangamesh-kodge/WebVision1.0
Bibtex
@software{Kodge_WebVision1_0_2024,
author = {Kodge, Sangamesh},
month = feb,
title = {{WebVision1.0}},
url = {https://github.com/sangamesh-kodge/WebVision1.0},
year = {2024}
}
Owner
- Login: sangamesh-kodge
- Kind: user
- Repositories: 1
- Profile: https://github.com/sangamesh-kodge
Citation (CITATION.cff)
cff-version: 1.2.0 message: "If you use this software, please cite it as below." authors: - family-names: "Kodge" given-names: "Sangamesh" orcid: "https://orcid.org/0000-0001-9713-5400" title: "WebVision1.0" date-released: 2024-2-7 url: "https://github.com/sangamesh-kodge/WebVision1.0"