ssl-acoustic

SSL Model for Underwater Acoustic Data

https://github.com/ahmetpala/ssl-acoustic

Science Score: 57.0%

This score indicates how likely this project is to be science-related based on various indicators:

✓
CITATION.cff file
Found CITATION.cff file
✓
codemeta.json file
Found codemeta.json file
✓
.zenodo.json file
Found .zenodo.json file
✓
DOI references
Found 5 DOI reference(s) in README
○
Academic publication links
○
Academic email domains
○
Institutional organization owner
○
JOSS paper metadata
○
Scientific vocabulary similarity
Low similarity (12.9%) to scientific vocabulary

Last synced: 9 months ago · JSON representation ·

Repository

SSL Model for Underwater Acoustic Data

Basic Info

Host: GitHub
Owner: ahmetpala
License: apache-2.0
Language: Python
Default Branch: main
Size: 44.3 MB

Statistics

Stars: 2
Watchers: 1
Forks: 0
Open Issues: 0
Releases: 0

Created over 1 year ago · Last pushed 10 months ago

Metadata Files

Readme License Citation

Self-supervised feature learning for acoustic data analysis

This repository contains the implementation of self-supervised learning methods for acoustic data analysis, focusing on fisheries echosounder data. The primary goal of this study was to develop a deep learning model inspired by the DINO architecture to extract acoustic features without requiring manual annotations. The model was trained using multiple data sampling strategies to address class imbalance and improve the discriminative power of features in downstream tasks such as classification and regression.

Main model overview

This repository implements a Self-Supervised Learning (SSL) model designed specifically for acoustic data analysis. The training process involves the following steps:

Data preparation: Extract acoustic data patches from the training dataset using a predefined sampling scheme to form the training set.
View generation: For each patch x, generate two global views and eight local views, forming sets V_G and V_L. The views are chosen in pairs from the combined set V for training.
Network assignment:
- The teacher network receives only global views (V_G).
- The student network can receive both global and local views (V).
Training mechanism: The teacher and student networks align their outputs by minimizing the dissimilarity between the chosen views.
Parameter updates:
- Student network parameters (_s) are updated using the AdamW optimizer.
- Teacher network parameters (_t) are updated using an exponential moving average (EMA) technique.

SSL Model Overview

Figure: Overview of the self-supervised learning model applied in the study.

Repository structure

batch/: Contains submodules for organizing batch processing and data transformation tasks. Key subfolders include:
- data_augmentation/: Houses scripts that perform various data augmentation techniques to improve model generalization.
- data_transforms/: Contains scripts for performing transformations on input data to prepare it for training.
- label_transforms/: Scripts focused on transforming and handling label data appropriately.
- samplers/: Contains modules for defining custom sampling strategies used during training.
data/: Includes Python modules responsible for reading and managing datasets. This folder contains utility scripts used for handling data from external sources or preprocessed files:
- data_reader.py: Script for reading and parsing raw data files.
- dataloaders.py: Contains functions to create data loaders for efficient batch processing during training.
- partition.py: Helps split datasets based on a specified partitioning scheme.
Jupyter_Notebooks/: Contains Jupyter notebooks for demonstrating and documenting different steps in the training, testing, and analysis processes. Key notebooks include:
- TrainingDataCreation/: Contains notebook related to generating training datasets and conducting experiments.
- Final_Tests.ipynb: Evaluates model performance and provides visual insights into the results.
- EchogramPaintingCode.ipynb: Notebooks for visualizing echograms with labeled regions or features of interest.
- UNETPredictionsReading.ipynb: A notebook to read and evaluate the predictions made by the UNet model.
utils_unet/: Contains utility scripts and helper functions that support model development and data processing tasks.
Modules Outside Main Folders
- LabelingSurveyPatches.py: Assigns labels to survey patches for creating labeled datasets.
- maindinoacoustic_FixedData.py: Implements the DINO-inspired SSL model and manages the main training loop.
- modifiedresnet*.py: Defines customized ResNet architectures for extracting acoustic features with varying output configurations.
- extractfeaturesacoustic_FixedData.py: Extracts features from acoustic datasets using the trained self-supervised model for downstream tasks.

Prerequisites

numpy~=1.26.2
matplotlib~=3.8.1
xarray~=0.20.0
pandas~=1.2.3
scipy~=1.11.3
torch~=2.0.1
PyYAML~=6.0.1
Pillow~=10.0.1
torchvision~=0.16.1
tqdm~=4.66.1
joblib~=1.3.2
scikit-learn~=1.3.2
zarr~=2.6.1
numcodecs~=0.12.1
opencv-python~=4.8.1.78
requests~=2.28.1
scikit-image~=0.22.0

Usage

Labeling non-overlapping patches: Use the Labeling_Survey_Patches.py script to assign labels to patches created from each survey data. This prepares the metadata containing the center coordinates and average Sv values for intensity-based sampling and training tasks.
Training data creation: Run the Jupyter notebook in Training_Data_Creation/ to identify the center coordinates and survey year of patches selected based on intensity-based sampling scheme.
Extracting training data patches: Use the script Producing_Training_Data_Python_IntensityBased2.py to extract data patches based on specified criteria, preparing them for model training.
Training SSL model: Train the model using main_dino_acoustic_FixedData.py. This script initiates the training process and saves the trained model.
Feature extraction:

Produce data patches: For specific survey year, generate test patches by running scripts such as Producing_Annotation_Patches_Test_Data_2017_Python.py.
Extract and save features: Run extract_features_acoustic_FixedData.py to extract features from patches based on the trained model.
1. Model evaluation:
Test results and visualizations: Run Final_Tests.ipynb to assess model performance and visualize key results.
Evaluate raw data: Use Testing_Pixel_Based_KNN.ipynb to analyze the models results on raw acoustic data.
UNET results: Run UNET_Predictions_Reading.ipynb to view and analyze predictions from the UNET model.
- Note: Before running the scripts, make sure to update the data directory paths as needed in each script or notebook to reflect your local file structure.

Data availability

The data used in this study are securely stored on servers managed by the Institute of Marine Research (IMR). Due to its extenive size (as cetailed in the Supplementary Material), access can be granted via an Amazon S3 access token upon request, in accordance with IMRs data-sharing policies. Detailed information about the dataset's size, structure, and metadata is included in the Supplementary Material to further facilitate reproducibility.

Citation

If you use this repository or build upon this work, please cite the following paper:

Pala, A., Oleynik, A., Handegard, N.O. (2024). Self-supervised feature learning for acoustic data analysis. Ecological Informatics, 80, 102533. https://doi.org/10.1016/j.ecoinf.2024.102533

BibTeX

bibtex @article{pala2024sslacoustic, title = {Self-supervised feature learning for acoustic data analysis}, author = {Pala, Ahmet and Oleynik, Anna and Handegard, Nils Olav}, journal = {Ecological Informatics}, volume = {80}, pages = {102533}, year = {2024}, publisher = {Elsevier}, doi = {10.1016/j.ecoinf.2024.102533} }

Contact

For questions or collaboration inquiries, please reach out to the corresponding author, Ahmet Pala.

Owner

Login: ahmetpala
Kind: user

Repositories: 2
Profile: https://github.com/ahmetpala

Citation (CITATION.cff)

cff-version: 1.2.0
message: "If you use this software, please cite the following paper."
title: "Self-supervised feature learning for acoustic data analysis"
version: "1.0"
url: "https://github.com/ahmetpala/SSL-Acoustic"
authors:
  - family-names: Pala
    given-names: Ahmet
  - family-names: Oleynik
    given-names: Anna
  - family-names: Handegard
    given-names: Nils Olav

preferred-citation:
  type: article
  title: "Self-supervised feature learning for acoustic data analysis"
  authors:
    - family-names: Pala
      given-names: Ahmet
    - family-names: Oleynik
      given-names: Anna
    - family-names: Handegard
      given-names: Nils Olav
  doi: "10.1016/j.ecoinf.2024.102878"
  url: "https://www.sciencedirect.com/science/article/pii/S1574954124004205"
  date-published: 2024-12-01

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Open Source Science