https://github.com/denajgibbon/gibbon-feature-comparison

https://github.com/denajgibbon/gibbon-feature-comparison

Science Score: 13.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
  • .zenodo.json file
  • DOI references
    Found 1 DOI reference(s) in README
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (7.4%) to scientific vocabulary
Last synced: 4 months ago · JSON representation

Repository

Basic Info
  • Host: GitHub
  • Owner: DenaJGibbon
  • Language: R
  • Default Branch: master
  • Size: 120 MB
Statistics
  • Stars: 0
  • Watchers: 2
  • Forks: 0
  • Open Issues: 0
  • Releases: 0
Created over 3 years ago · Last pushed over 2 years ago

https://github.com/DenaJGibbon/Gibbon-feature-comparison/blob/master/



# Overview

This is the repository for: Mel-frequency cepstral coefficients
outperform embeddings from pre-trained convolutional neural networks
under noisy conditions for discrimination tasks of individual gibbons
(Lakdari et al, under review). The goal of the paper is to compare
different approaches of feature extraction for individual discrimination
of gibbon female calls.

Feature extraction was done in both R and Python, and analyses for
publication were done in R.

# Data availability

Acoustic data can be downloaded at: 10.5281/zenodo.8205685.

## Metadata for all data included in the repository.

FileLocation Description Date DateType Summary
data/features A folder containing the different feature sets for each .wav file, along with recorder ID that include location, time, and date. There is also a column for individual ID Fri Sep 15 08:49:34 2023 Folder The folder contains .csv files for acoustic indices, BirdNET, MFCCs, VGGIsh, and Wav2Vec2. For VGGIsh and BirdNET the .csv files are divided by recorder location.
data/MB Playbacks 50 m.csv A .csv file containing the GPS coordinates of the recorders Fri Sep 15 08:49:34 2023 .csv This file contains the GPS coordinates of each recorder M01-M09
data/randomization_affinity Contains a .csv files for each feature type Fri Sep 15 08:49:34 2023 Folder This file contains the classification accuracy, recorder, number of clusters returned by affinity propagation clustering, and normalized mutual information value
data/randomization_hdbscan Contains a .csv files for each feature type Fri Sep 15 08:49:34 2023 Folder This file contains the classification accuracy, recorder, number of clusters returned by hdbscan, and normalized mutual information value
data/snr_df Contains a .csv files for each recorder location Fri Sep 15 08:49:34 2023 Folder This file contains the recording ID, signal-to-noise ratio, recorder, and wave file path.
# Feature/embedding extraction on the audio data ### MFCCs MFCCs are calculated using the Processing features for randomization.R R script. ### BirdNET Follow the installation instructions here: . Then use the BirdNET Terminal Script. Then run the Processing features for randomization.R R script to convert BirdNET embeddings into the format needed for analyses. ### VGGish Follow installation instructions: . Then use the VGGish Terminal Script. Then run the Processing features for randomization.R R script to convert VGGish embeddings into the format needed for analyses. ### Wav2Vec2 Wav2Vec2 embeddings are caluclated using the Wav2Vec2_Features.py Python script. ### Acoustic indices Acoustic indices are calculated using the Processing features for randomization.R R script. # SNR calculation on audio data SNR calculation is done on sound clips that have an extra 2-s on either side of the call using the SNR Calculation R script. # Supervised classification and unsupervised clustering of processed data Use the Randomization for playbacks.R script to randomly divide data for each feature and distance category using a 80/20 split. This script uses the processed data for each feature located in the data/features/features folder. # Creating plots for publication See Script to recreate figures.R to recreate all figures in publication.
Figure 1. Uniform Manifold Approximation and Projections (UMAP) of female gibbon calls recorded ~ 50 m away from the playback speaker for each feature type.

Owner

  • Name: Dena J. Clink
  • Login: DenaJGibbon
  • Kind: user
  • Company: K. Lisa Yang Center for Conservation Bioacoustics

I am a biological anthropologist, bioacoustician, and avid R user. I use innovative bioacoustics techniques to answer evolutionary questions.

GitHub Events

Total
  • Watch event: 1
Last Year
  • Watch event: 1