Scientific Software
Updated 6 months ago

resampy — Peer-reviewed • Rank 22.6 • Science 95%

resampy: efficient sample rate conversion in Python - Published in JOSS (2016)

Scientific Software · Peer-reviewed
Scientific Software
Updated 6 months ago

Soundata — Peer-reviewed • Rank 17.0 • Science 100%

Soundata: Reproducible use of audio datasets - Published in JOSS (2024)

Scientific Software · Peer-reviewed
Scientific Software
Updated 6 months ago

MIDI.jl — Peer-reviewed • Rank 10.6 • Science 100%

MIDI.jl: Simple and intuitive handling of MIDI data. - Published in JOSS (2019)

Scientific Software · Peer-reviewed
Scientific Software
Updated 6 months ago

audiomate — Peer-reviewed • Rank 12.8 • Science 95%

audiomate: A Python package for working with audio datasets - Published in JOSS (2020)

Artificial Intelligence and Machine Learning (40%)
Scientific Software · Peer-reviewed
Scientific Software
Updated 6 months ago

libfmp — Peer-reviewed • Rank 14.1 • Science 93%

libfmp: A Python Package for Fundamentals of Music Processing - Published in JOSS (2021)

Engineering (40%)
Scientific Software · Peer-reviewed
Scientific Software
Updated 6 months ago

s(ound)lab — Peer-reviewed • Rank 12.0 • Science 93%

s(ound)lab: An easy to learn Python package for designing and running psychoacoustic experiments. - Published in JOSS (2021)

Neuroscience
Scientific Software · Peer-reviewed
Scientific Software
Updated 6 months ago

libsoni — Peer-reviewed • Rank 8.1 • Science 95%

libsoni: A Python Toolbox for Sonifying Music Annotations and Feature Representations - Published in JOSS (2024)

Mathematics
Scientific Software · Peer-reviewed
Updated 6 months ago

transformers • Rank 38.7 • Science 64%

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Updated 5 months ago

gladia-torchaudio • Rank 30.4 • Science 64%

Data manipulation and transformation for audio signal processing, powered by PyTorch

Updated 6 months ago

librosa • Rank 29.4 • Science 59%

Python library for audio and music analysis

Updated 5 months ago

pyaudioanalysis • Rank 22.2 • Science 59%

Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications

Updated 6 months ago

musicaiz • Rank 9.9 • Science 67%

A python framework for symbolic music generation, evaluation and analysis

Updated 6 months ago

aac-metrics • Rank 9.6 • Science 67%

Metrics for evaluating Automated Audio Captioning systems, designed for PyTorch.

Updated 5 months ago

pliers • Rank 14.0 • Science 54%

Automated feature extraction in Python

Updated 4 months ago

https://github.com/fgnt/padertorch • Rank 13.5 • Science 54%

A collection of common functionality to simplify the design, training and evaluation of machine learning models based on pytorch with an emphasis on speech processing.

Updated 6 months ago

dawdreamer • Rank 16.1 • Science 46%

Digital Audio Workstation with Python; VST instruments/effects, parameter automation, FAUST, JAX, Warp Markers, and JUCE processors

Updated 4 months ago

https://github.com/fgnt/nara_wpe • Rank 18.7 • Science 41%

Different implementations of "Weighted Prediction Error" for speech dereverberation

Updated 4 months ago

https://github.com/fgnt/paderbox • Rank 15.6 • Science 44%

Paderbox: A collection of utilities for audio / speech processing

Updated 6 months ago

huggingsound • Rank 13.3 • Science 44%

HuggingSound: A toolkit for speech-related tasks based on Hugging Face's tools

Updated 5 months ago

https://github.com/csteinmetz1/auraloss • Rank 19.0 • Science 33%

Collection of audio-focused loss functions in PyTorch

Updated 6 months ago

llama_ros • Rank 7.2 • Science 44%

llama.cpp (GGUF LLMs) and llava.cpp (GGUF VLMs) for ROS 2

Updated 5 months ago

python-sounddevice • Rank 24.9 • Science 26%

:sound: Play and Record Sound with Python :snake:

Updated 5 months ago

audioread • Rank 27.3 • Science 23%

cross-library (GStreamer + Core Audio + MAD + FFmpeg) audio decoding for Python

Updated 6 months ago

audiotree • Rank 5.4 • Science 44%

Audio data loading and augmentations in JAX

Updated 6 months ago

audio_common • Rank 4.2 • Science 44%

A PortAudio based audio_common with text to speech for ROS 2

Updated 6 months ago

dx7-jax • Rank 2.7 • Science 44%

Yamaha DX7 synthesizer with JAX

Updated 5 months ago

wav2vec • Rank 9.4 • Science 36%

Python package and cli tool to convert wave files (WAV or AIFF) to vector graphics (SVG, PostScript, CVS)

Updated 6 months ago

audiodeepfakedetection • Rank 5.9 • Science 36%

SUTD 50.039 Deep Learning Course Project (2022 Spring)

Updated 6 months ago

riffusion-hobby • Rank 10.0 • Science 31%

Stable diffusion for real-time music generation

Updated 6 months ago

mosqito • Rank 14.8 • Science 26%

MoSQITo is a unified and modular development framework of key sound quality metrics favoring reproducible science and efficient shared scripting among engineers, teachers and researchers community.

Updated 5 months ago

https://github.com/dbraun/dac-jax • Rank 3.2 • Science 36%

JAX Implementations of Descript Audio Codec and EnCodec

Updated 5 months ago

https://github.com/dbraun/abletonparsing • Rank 9.8 • Science 26%

Parse an Ableton ASD clip file (warp markers and more) in Python

Updated 5 months ago

conformer • Rank 6.1 • Science 23%

Implementation of the convolutional module from the Conformer paper, for use in Transformers

Updated 5 months ago

https://github.com/csteinmetz1/automix-toolkit • Rank 6.0 • Science 23%

Models and datasets for training deep learning automatic mixing models

Updated 6 months ago

macossystemsounds • Rank 2.2 • Science 26%

macOS system sounds in multiple formats (including Base64.)

Updated 5 months ago

response • Rank 12.8 • Science 13%

Your handy frequency and impulse response processing object

Updated 5 months ago

https://github.com/audiolabs/trackswitch.js • Rank 6.6 • Science 13%

A Versatile Web-Based Audio Player for Presenting Scientific Results

Updated 5 months ago

https://github.com/dbraun/td-faust • Rank 5.5 • Science 13%

FAUST (Functional Audio Stream) for TouchDesigner

Updated 6 months ago

lhotse • Science 54%

Tools for handling multimodal data in machine learning projects.

Updated 5 months ago

https://github.com/dbraun/faust-tutorial • Science 26%

Multi-day Faust Tutorial (syntax, plugins, microcontrollers, pedals)

Updated 5 months ago

https://github.com/dbraun/chuckdesigner • Science 13%

ChucK audio integration with TouchDesigner

Updated 6 months ago

stipa • Science 44%

MATLAB implementation of the Speech Transmission Index for Public Address (STIPA) method for evaluating the speech transmission quality.

Updated 5 months ago

https://github.com/dbraun/vita • Science 26%

Python bindings to the Vital Synthesizer

Updated 6 months ago

open-in-mpv • Science 44%

Host-side of the extension to open any link or page URL in mpv via the browser context menu.

Updated 6 months ago

roomfuser • Science 54%

Acoustic impulse response generation using diffusion models

Updated 6 months ago

microtonalview • Science 44%

Feel the untold tones—where notation falls short!

Updated 5 months ago

https://github.com/birdnet-team/birdnet-stm32 • Science 26%

Code for training and deployment of a tiny acoustic model for the STM32N6

Updated 6 months ago

2d3mf • Science 36%

Code and models for the paper "2D3MF: Deepfake Detection using Multi Modal Middle Fusion"

Updated 5 months ago

https://github.com/baggepinnen/lazywavfiles.jl • Science 10%

Lazily treat wav (audio) files as arrays. Arrays can be distributed over many wav files.

Updated 6 months ago

beep • Science 26%

Beep is a single, 12-line, 358-character AppleScript that plays a system notification sound for the numeric value of the current hour in quick succession. (Think grandfather clock.)

Updated 5 months ago

https://github.com/aaltoml/nonstationary-audio-gp • Science 10%

End-to-End Probabilistic Inference for Nonstationary Audio Analysis

Updated 6 months ago

quantumaudio • Science 67%

A Python package for building Quantum Representations of Digital Audio. Developed by Moth.

Updated 5 months ago

https://github.com/alexanderlerch/aca-slides • Science 26%

Slides and Code for "An Introduction to Audio Content Analysis," also taught at Georgia Tech as MUSI-6201. This introductory course on Music Information Retrieval is based on the text book "An Introduction to Audio Content Analysis", Wiley 2012/2022

Updated 5 months ago

https://github.com/alexisvassquez/ai_spotibot_player • Science 26%

AudioMIX is an open-source, AI-driven music production software designed to empower independent artists and DJs with mood-based audio analysis, LED integration, and creative autonomy. Spotibot was its original name.

Updated 5 months ago

https://github.com/csteinmetz1/ronn • Science 23%

Randomized overdrive neural networks

Updated 5 months ago

https://github.com/bbye98/minim • Science 26%

A collection of music service (iTunes, Qobuz, Spotify, TIDAL) APIs for media information retrieval and semi-automated music tagging.

Updated 5 months ago

https://github.com/csteinmetz1/ai-audio-startups • Science 13%

Community list of startups working with AI in audio and music technology

Updated 6 months ago

radtts-uk • Science 67%

High-fidelity speech synthesis for Ukrainian using modern neural networks.

Updated 6 months ago

laughter-detection-icsi • Science 36%

A ML-pipeline for training a laughter detection model on the ICSI corpus

Updated 5 months ago

https://github.com/bkraad47/fat_llama • Science 13%

fat_llama is a Python package for upscaling audio files to FLAC or WAV formats using advanced audio processing techniques. It utilizes CUDA-accelerated calculations to enhance audio quality by upsampling and adding missing frequencies through FFT, resulting in richer and more detailed audio.

Updated 6 months ago

anira • Science 57%

an architecture for neural network inference in real-time audio applications

Updated 6 months ago

nsynth-midi-renderer • Science 26%

Sample based concatenative synthesizer for the NSynth dataset. Render any MIDI (.mid) sequence with the notes of NSynth.

Updated 6 months ago

eeg-cardio-audio-sleep • Science 13%

Project to study sound stimulus synchronous, asynchronous and isochronous with the heartbeat during sleep.

Updated 6 months ago

snac • Science 54%

Multi-Scale Neural Audio Codec (SNAC) compresses audio into discrete codes at a low bitrate

Updated 6 months ago

seansaudiodb • Science 26%

The personal version of my audio collection database. Not intended for public use. See the other version that is intended for public use: https://github.com/seanpm2001/AudiBass_Manager

Updated 6 months ago

speech-utility-bioacoustics • Science 67%

On the utility of speech and audio foundation models for marmoset call analysis

Updated 5 months ago

https://github.com/bluepixeldev/aftertone • Science 26%

A lightweight Unity plugin for audio pooling. This plugin aims to simplify playing throwaway abrupt sound effects such as gunshot, collision or UI sounds by utilizing a pool of prepared audio sources to reuse and play.

Updated 6 months ago

librosax • Science 44%

Librosa in JAX

Updated 5 months ago

ltfat • Science 31%

Official development repository of the Large Time Frequency Analysis Toolbox

Updated 6 months ago

iossystemsounds • Science 26%

Sounds found in and extracted from System/Library/Audio/UISounds in multiple formats.

Updated 6 months ago

avocodo • Science 26%

A MATLAB-based GUI that optimizes the experience in audio/video coding EEG data.

Updated 5 months ago

https://github.com/neodsp/neolab • Science 26%

neolab is a framework to load, analyze and manipulate audio data