salmon-scan
A dataset of salmon fish disease monitoring and control
Science Score: 67.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
✓DOI references
Found 2 DOI reference(s) in README -
✓Academic publication links
Links to: sciencedirect.com -
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (10.1%) to scientific vocabulary
Repository
A dataset of salmon fish disease monitoring and control
Basic Info
- Host: GitHub
- Owner: shoaibmehrab
- Default Branch: master
- Size: 44.9 KB
Statistics
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
- Releases: 2
Metadata Files
README.md
Welcome to the SalmonScan Dataset
The SalmonScan dataset is a collection of images of salmon fish, including healthy fish and infected fish. The dataset consists of two classes of images:
- Fresh salmon 🐟
- Infected Salmon 🐠
This dataset is ideal for various computer vision tasks in machine learning and deep learning applications. Whether you are a researcher, developer, or student, the SalmonScan dataset offers a rich and diverse data source to support your projects and experiments.
So, dive in and explore the fascinating world of salmon health and disease!
Data Overview
The SalmonScan dataset consists of approximately 1,208 images of salmon fish, classified into two classes:
- Fresh salmon (healthy fish with no visible signs of disease), 456 images
- Infected Salmon containing disease, 752 images
Each class contains a representative and diverse collection of images, capturing a range of different perspectives, scales, and lighting conditions. The images have been carefully curated to ensure that they are of high quality and suitable for use in a variety of computer vision tasks.
The dataset is stored in PNG format and is currently without a split into training, validation, and test sets. However, a typical split would be 60% training, 20% validation, and 20% test, giving the following counts:
| Split | Fresh Fish 🐟 | Infected Fish 🐠 | Total | |------------|---------------|------------------|---------| | Training | 273 | 451 | 724 | | Validation | 92 | 151 | 243 | | Test | 91 | 150 | 241 | | Total | 456 | 752 | 1208|
Whether you are interested in developing new computer vision algorithms, testing existing models, or simply learning about salmon health and disease, the SalmonScan dataset provides an excellent resource for your needs.
Data Preprocessing
The input images were preprocessed to enhance their quality and suitability for further analysis. The following steps were taken:
Resizing 📏: All the images were resized to a uniform size of 600 pixels in width and 250 pixels in height to ensure compatibility with the learning algorithm.
Image Augmentation 📸: To overcome the small amount of images, various image augmentation techniques were applied to the input images. These included:
- Horizontal Flip ↩️: The images were horizontally flipped to create additional samples.
- Vertical Flip ⬆️: The images were vertically flipped to create additional samples.
- Rotation 🔄: The images were rotated to create additional samples.
- Cropping 🪓: A portion of the image was randomly cropped to create additional samples.
- Gaussian Noise 🌌: Gaussian noise was added to the images to create additional samples.
- Shearing 🌆: The images were sheared to create additional samples.
- Contrast Adjustment (Gamma) ⚖️: The gamma correction was applied to the images to adjust their contrast.
- Contrast Adjustment (Sigmoid) ⚖️: The sigmoid function was applied to the images to adjust their contrast.
These preprocessing steps were performed to increase the size of the dataset and make it more diverse and representative, which can improve the performance of the learning algorithm.
Usage
To use the salmon scan dataset in your ML and DL projects, follow these steps:
- Clone or download the salmon scan dataset repository from GitHub.
- Unzip the file to access the two folders (FreshFish and InfectedFish).
- Load the images into your preferred programming environment, such as Python.
- Use standard libraries such as
numpyorpandasto convert the images into arrays, which can be input into a machine learning or deep learning model. - Split the dataset into training, validation, and test sets as per your requirement.
- Preprocess the data as needed, such as resizing and normalizing the images.
- Train your ML/DL model using the preprocessed training data.
- Evaluate the model on the test set and make predictions on new, unseen data.
Dataset Download
You can download the dataset using the following link:
Note: The download button is located in the top left corner after clicking the link.
License
The salmon scan dataset is released under the MIT License, which is an open source license that permits use, modification, and distribution of the software. The full license text can be found in the LICENSE file.
Citation
If you use this dataset in your research or projects, please cite the following references:
``` @article{ahmed2024salmonscan, title={SalmonScan: A Novel Image Dataset for Machine Learning and Deep Learning Analysis in Fish Disease Detection in Aquaculture}, author={Ahmed, Md Shoaib and Jeba, Samiha Maisha}, journal={Data in Brief}, pages={110388}, year={2024}, publisher={Elsevier} }
```
@article{SHOAIBAHMED2021,
title = {Fish Disease Detection Using Image Based Machine Learning Technique in Aquaculture},
journal = {Journal of King Saud University - Computer and Information Sciences},
year = {2021},
issn = {1319-1578},
doi = {https://doi.org/10.1016/j.jksuci.2021.05.003},
url = {https://www.sciencedirect.com/science/article/pii/S1319157821001063},
author = {Md {Shoaib Ahmed} and Tanjim {Taharat Aurpa} and Md. {Abul Kalam Azad}},
keywords = {Fish Diseases, Aquaculture, Image Processing, Machine Learning, Support Vector Machine, Salmon Fish}
}
Owner
- Name: Md Shoaib Ahmed
- Login: shoaibmehrab
- Kind: user
- Location: Dhaka
- Company: S3 Innovate Pte. Ltd
- Twitter: shoaibmehrab
- Repositories: 1
- Profile: https://github.com/shoaibmehrab
Software Engineer at Brain Station 23 Ltd. | Full-Stack web developer | Former MLSA @microsoft | Founder and Former President @JU-Computer-Club
Citation (CITATION.cff)
@article{SHOAIBAHMED2021,
title = {Fish Disease Detection Using Image Based Machine Learning Technique in Aquaculture},
journal = {Journal of King Saud University - Computer and Information Sciences},
year = {2021},
issn = {1319-1578},
doi = {https://doi.org/10.1016/j.jksuci.2021.05.003},
url = {https://www.sciencedirect.com/science/article/pii/S1319157821001063},
author = {Md {Shoaib Ahmed} and Tanjim {Taharat Aurpa} and Md. {Abul Kalam Azad}},
keywords = {Fish Diseases, Aquaculture, Image Processing, Machine Learning, Support Vector Machine, Salmon Fish}
}
GitHub Events
Total
- Fork event: 1
Last Year
- Fork event: 1