iv-q4-birdclef-2024

Team Epoch IV's solution to the BirdCLEF 2024 competition. Includes the award-winning working note🏆

https://github.com/teamepochgithub/iv-q4-birdclef-2024

Science Score: 44.0%

This score indicates how likely this project is to be science-related based on various indicators:

✓
CITATION.cff file
Found CITATION.cff file
✓
codemeta.json file
Found codemeta.json file
✓
.zenodo.json file
Found .zenodo.json file
○
DOI references
○
Academic publication links
○
Academic email domains
○
Institutional organization owner
○
JOSS paper metadata
○
Scientific vocabulary similarity
Low similarity (2.0%) to scientific vocabulary

Last synced: 11 months ago · JSON representation ·

Repository

Team Epoch IV's solution to the BirdCLEF 2024 competition. Includes the award-winning working note🏆

Basic Info

Host: GitHub
Owner: TeamEpochGithub
License: mit
Language: Jupyter Notebook
Default Branch: main
Homepage:
Size: 58.9 MB

Statistics

Stars: 4
Watchers: 1
Forks: 0
Open Issues: 0
Releases: 0

Created about 2 years ago · Last pushed almost 2 years ago

Metadata Files

Readme License Citation

BirdCLEF 2024 - Best Working Note 🏆

This is Team Epoch IV's solution to the BirdCLEF 2024 competition, placed 247/992.

The award-winning working note, Addressing the Challenges of Domain Shift in Bird Call Classification for BirdCLEF 2024, is included in this repository at ./docs/working_note/.

Reproducing Our Best Submission

This section contains the steps that need to be taken to reproduce our best submission on the private leaderboard.

Prerequisites

Models were trained on machines with the following specifications:

CPU: AMD Ryzen Threadripper Pro 3945WX 12-Core Processor / AMD Ryzen 9 7950X 16-Core Processor
GPU: NVIDIA RTX A5000 / NVIDIA RTX Quadro 6000 / NVIDIA RTX A6000
RAM: 96GB / 128GB
OS: Ubuntu 23.10 / Arch Linux 2024.04.10
Python: 3.10.13

Estimated training time: 1-3 hours per model on these machines.

For running inference, a machine with at least 32GB of RAM is recommended.

0. Clone the repository

Make sure to clone the repository with your favourite git client or using the following command:

shell git clone https://github.com/TeamEpochGithub/iv-q4-detect-bird.git

1. Set up the environment

First, clone the repository and navigate to the project directory. Make sure Rye is installed on your machine and run:

bash rye sync

Alternatively, you can manually install Python 3.10.13, set up a virtual environment, and install the dependencies from requirements-dev.lock using pip:

bash pip install -r requirements-dev.lock

2. Set up the training & testing data

Download the competition data here or use the following command:

bash kaggle competitions download -c birdclef-2024

Then extract birdclef-2024.zip to data/raw/2024/.

Place the sounds you want to make predictions on in data/raw/2024/test_soundscapes/.

3. Train the model

train.py is used to train a model. train.py reads a configuration file from conf/train.yaml. This configuration file contains the model configuration to train with additional training parameters such as test_size and a scorer to use. The model selected in the conf/train.yaml can be found in the conf/model/ folder where a whole model configuration is stored, from preprocessing to postprocessing. When training is finished, the model is saved in tm/ with a hash that depends on the specific preprocessing & pretraining steps, and model configuration.

You can skip this step if you only want to run inference on the test data with the model from our best submission, as we already included this model in this repository as tm/cfd080d568b9341fa9b02decb0e59ae1_0.pt. If you wish to retrain this model, ensure that the model in conf/train.yaml is set to playful-monkey-752 and run train.py.

4. Run inference

submit.py runs inference on the test data from the competition given a trained model or an ensemble of trained models. It reads a configuration file from conf/submit.yaml which contains the model/ensemble configuration to use for inference. Model configs can be found in conf/model/ and ensemble configs in conf/ensemble. conf/ensemble specifies the models (from conf/model) to use for the ensemble and the weights to use for each model.

Ensure that the model in conf/submit.yaml is set to playful-monkey-752 and run submit.py to generate the submission file submission/submission.csv.

pre-commit

This repository uses pre-commit with Ruff and MyPy hooks for code quality checks and auto-formatting. To install the pre-commit hooks, run:

bash rye run pre-commit install

To run the pre-commit checks on all files, run:

bash rye run pre-commit run --all-files

Contributors

This repository was created by Team Epoch IV, based in the Dream Hall of the Delft University of Technology.

Owner

Name: Team Epoch
Login: TeamEpochGithub
Kind: organization
Email: info@teamepoch.net

Website: https://www.teamepoch.net/
Repositories: 1
Profile: https://github.com/TeamEpochGithub

Citation (CITATION.cff)

cff-version: 1.2.0
message: "If you use this software, please cite it as below."
authors:
- family-names: "Lim"
  given-names: "Jeffrey"
  affiliation: "TU Delft Dream Team Epoch"
  email: "Jeffrey-Lim@outlook.com"
- family-names: "Witting"
  given-names: "Emiel"
  affiliation: "TU Delft Dream Team Epoch"
  email: "emiel.witting@gmail.com"
- family-names: "Heer"
  name-particle: "de"
  given-names: "Hugo"
  affiliation: "TU Delft Dream Team Epoch"
  email: "hugodeheer1234@gmail.com"
- family-names: "Kopar"
  given-names: "Cahit Tolga"
  affiliation: "TU Delft Dream Team Epoch"
  email: "cahittolgakopar@gmail.com"
- family-names: "Sándor"
  given-names: "Kristóf"
  affiliation: "TU Delft Dream Team Epoch"
  email: "Emherk512@gmail.com"
title: "Identifying Bird Calls"
version: 1.0.0
date-released: 2024-06-12
url: "https://github.com/TeamEpochGithub/iv-q4-birdclef-2024"

GitHub Events

Total

Watch event: 1

Last Year

Watch event: 1

Dependencies

kaggle_requirements.txt pypi

audiomentations ==0.35.0
epochalyst ==0.3.2
hydra-core ==1.3.2
onnxruntime ==1.17.3

pyproject.toml pypi

annotated-types >=0.6.0
coloredlogs >=15.0.1
dask >=2024.4.0
dask-expr >=1.0.12
epochalyst ==0.3.2
hydra-core >=1.3.2
kornia >=0.7.2
librosa >=0.10.1
matplotlib >=3.8.2
numpy >=1.26.2
onnx >=1.16.0
onnxruntime >=1.17.3
pandas >=2.1.4
plotly >=5.18.0
randomname >=0.2.1
scikit-learn >=1.4.2
scipy >=1.12.0
seaborn >=0.13.0
sparsemax >=0.1.9
timm >=0.9.16
torch ==2.2.2+cu121
torchaudio ==2.2.2+cu121
torchvision ==0.17.2+cu121
tqdm >=4.60.0
typing_extensions >=4.9.0
wandb >=0.16.2

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Open Source Science