iseparate-sdx

iSeparate library for the SDX2023 challenge

https://github.com/naba89/iseparate-sdx

Science Score: 54.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
  • Academic publication links
    Links to: arxiv.org
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (3.3%) to scientific vocabulary
Last synced: 10 months ago · JSON representation ·

Repository

iSeparate library for the SDX2023 challenge

Basic Info
  • Host: GitHub
  • Owner: naba89
  • License: mit
  • Language: Python
  • Default Branch: main
  • Size: 718 KB
Statistics
  • Stars: 13
  • Watchers: 2
  • Forks: 0
  • Open Issues: 0
  • Releases: 0
Created over 3 years ago · Last pushed over 2 years ago
Metadata Files
Readme License Citation

README.md

iSeparate-SDX

This library contains the code to train and reproduce the results for the submissions by username: subatomicseer for the SDX 2023 Challenge

Author: Nabarun Goswami

Affiliation: Harada-Osa-Mukuta-Kurose Lab, The University of Tokyo

Submission and Results Summary

  • MDX Leaderboard A (labelnoise)

    | SDRsong | SDRbass | SDRdrums | SDRother | SDR_vocals | |:----------:|:----------:|:----------:|:-----------:|:------------:| | 6.601 | 6.696 | 7.026 | 4.611 | 8.072 |

  • MDX Leaderboard B (bleeding)

    | SDRsong | SDRbass | SDRdrums | SDRother | SDR_vocals | |:--------:| :------: | :-------: | :-------: | :--------: | | 6.314 | 6.331 | 6.864 | 4.591 | 7.469 |

  • MDX Leaderboard C (Open)

    | SDRsong | SDRbass | SDRdrums | SDRother | SDR_vocals | |:--------:| :------: | :-------: | :-------: | :--------: | | 8.537 | 9.328 | 9.328 | 6.182 | 9.311 |

  • CDX Leaderboard A and B

    • Submission repo: CDX2023-dnr-submission
    • Submission ID: 220293
    • Submitter: subatomicseer
    • Final rank: 3rd place (A), 6th place (B)
    • Final scores:

    | SDRmean | SDRdialog | SDReffect | SDRmusic | |:--------:| :------: | :-------: | :-------: | | 4.144 |7.178 |3.466 |2.011 |

Model Descriptions

Throughout the challenge, we used the following models: - DWT-Transformer-UNet: DWT-Transformer-UNet.md - Wavelet-HTDemucs: Wavelet-HTDemucs.md - BSRNN: https://arxiv.org/abs/2209.15174

Noise Robust Training for MDX Leaderboard A and B

The description of noise robust training losses is provided in: - Noise Robust Training

Data Augmentations

The description of data augmentations is provided in: - Data Augmentations

Environment setup

shell conda create -n sdx2023 python=3.8 conda activate sdx2023 conda install -c conda-forge ffmpeg pip install -r requirements.txt

Training instructions for individual models are provided in the docs folder:

Note regarding the datasets All datasets are assumed to be in a directory named DATASETS in the root directory of this project as shown below: - DATASETS/SDX2023/moisesdb23_bleeding_v1.0 - DATASETS/SDX2023/moisesdb23_labelnoise_v1.0 - DATASETS/DnR/dnr_v2 - DATASETS/MUSDB-HQ

Either copy the datasets to the above locations, or create symbolic links to the datasets, or you can change the dataset paths in the config files and pre-processing scripts.

MDX track

Leaderboard A (labelnoise): docs/TRAINING(MDX-Labelnoise).md

Leaderboard B (bleeding): docs/TRAINING(MDX-Bleeding).md

Leaderboard C (Open): docs/TRAINING(MDX-Open).md

CDX track

Leaderboard A and B: docs/TRAINING(CDX-DnR).md

References

[1] S. Rouard, et al., "Hybrid Transformers for Music Source Separation", Arxiv 2022

[2] Y. Luo, et al., "Music Source Separation with Band-split RNN", Arxiv 2022

[3] S. Uhlich, et al., "Improving music source separation based on deep neural networks through data augmentation and network blending", ICASSP 2017.

[4] S. Wisdom, et al., "Unsupervised Sound Separation Using Mixture Invariant Training", NeurIPS 2020

[5] A. Tarvainen, et al., "Mean teachers are better role models: Weight-averaged consistency targets improve semi-supervised deep learning results", NIPS 2017

[6] T. Ishida, et al., "Do We Need Zero Training Loss After Achieving Zero Training Error?", ICML 2020

[7] T. Nakamura, et al., "Time-Domain Audio Source Separation Based on Wave-U-Net Combined with Discrete Wavelet Transform", ICASSP 2020

Owner

  • Name: Nabarun Goswami
  • Login: naba89
  • Kind: user
  • Location: Tokyo
  • Company: @mil-tokyo

PhD Student @mil-tokyo @media-comp @nablas-inc

Citation (CITATION.cff)

cff-version: 1.2.0
message: "If you use this software, please cite it as below."
authors:
- family-names: "Goswami"
  given-names: "Nabarun"
  affiliation: "Harada-Osa-Mukuta-Kurose Lab, The University of Tokyo"
  email: "nabarungoswami@mi.t.u-tokyo.ac.jp"
title: "iSeparate-SDX"
version: 1
date-released: 2023-05-17
url: "https://github.com/naba89/iSeparate-SDX"

GitHub Events

Total
Last Year