Science Score: 44.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
○DOI references
-
○Academic publication links
-
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (10.3%) to scientific vocabulary
Repository
Basic Info
- Host: GitHub
- Owner: sangamesh-kodge
- License: apache-2.0
- Language: Shell
- Default Branch: master
- Size: 7.81 KB
Statistics
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
- Releases: 0
Metadata Files
Readme.md
Clothing1M
This project preprocess the Clothing 1M Dataset and gives a classwise directory structure. The raw dataset can be obtained by following steps on official repository
Dataset details
Check using find test/ -maxdepth 2 -type f | wc -l in terminal
- Number of nosiy train images - 1000000
- Number of clean train images - 47570
- Number of val images - 10526
- Number of test images - 14313
Below is the final directory structure:
The images in the directroy for each partition of the dataset are arranged in the directory with q00xx where xx is the class number from 0-13.
└── Clothing1M
├── clean_train
| ├── q0000
| ├── q0001
| └── ...
| └── q0013
|
├── noisy_train
| ├── q0000
| ├── q0001
| └── ...
| └── q0013
|
├── val
| ├── q0000
| ├── q0001
| └── ...
| └── q0013
|
├── test
| ├── q0000
| ├── q0001
| └── ...
| └── q0013
|
├── xxxx
├── xxxx
├── xxxx
├── xxxx
├── xxxx
└── creat_dataset.sh
└── helper.py
└── Readme.md
└── Citation.cff
└── LICENSE
Instructions
- Get access to dataset following steps on official repository
- Clone this repository.
bash git clone https://github.com/sangamesh-kodge/Clothing1M.git - Download the dataset from the download link obtained in step 1 in the cloned repository from step 2. (Or move data after download. )
- Unzip files in
images/directory usingtar -xf <file_name.tar>. Additionally unzip file in the root directory usingunzip <file_name.zip>. (See the commented lines 20-29 in create_dataset.sh). At this stage the cloned repository should haveimages/directory containing raw images and files with the label information for each image. Run the following command in your terminal/command contraining the cloned repository
bash sh create_dataset.sh
Conclusion
The project preprocess Clothing1M Dataset and gives a classwise directory structure.
Citation
Kindly cite the repository if you use the code. Thanks!
APA
Kodge, S. (2024). Clothing1M [Computer software]. https://github.com/sangamesh-kodge/Clothing1M
Bibtex
@software{Kodge_Clothing1M_2024,
author = {Kodge, Sangamesh},
month = feb,
title = {{Clothing1M}},
url = {https://github.com/sangamesh-kodge/Clothing1M},
year = {2024}
}
Owner
- Login: sangamesh-kodge
- Kind: user
- Repositories: 1
- Profile: https://github.com/sangamesh-kodge
Citation (CITATION.cff)
cff-version: 1.2.0 message: "If you use this software, please cite it as below." authors: - family-names: "Kodge" given-names: "Sangamesh" orcid: "https://orcid.org/0000-0001-9713-5400" title: "Clothing1M" date-released: 2024-2-29 url: "https://github.com/sangamesh-kodge/Clothing1M"