Projects | Open Source Science

Scientific Software

Updated 10 months ago

resampy — Peer-reviewed • Rank 22.6 • Science 95%

resampy: efficient sample rate conversion in Python - Published in JOSS (2016)

audio dsp nyucds python

Scientific Software · Peer-reviewed

Scientific Software

Updated 10 months ago

Soundata — Peer-reviewed • Rank 17.0 • Science 100%

Soundata: Reproducible use of audio datasets - Published in JOSS (2024)

audio bioacoustics dataset environmental-sound python urban-sound

Scientific Software · Peer-reviewed

Scientific Software

Updated 10 months ago

Spafe — Peer-reviewed • Rank 17.9 • Science 93%

Spafe: Simplified python audio features extraction - Published in JOSS (2023)

audio audio-analysis beat dsp features-extraction filterbank frequencies frequency frequency-analysis gammatone-filterbanks mfcc music music-information-retrieval pitch python signal-processing sound speech-processing time-frequency-analysis voice

Mathematics

Scientific Software · Peer-reviewed

Scientific Software

Updated 10 months ago

MIDI.jl — Peer-reviewed • Rank 10.6 • Science 100%

MIDI.jl: Simple and intuitive handling of MIDI data. - Published in JOSS (2019)

audio interface julia midi notes

Scientific Software · Peer-reviewed

Scientific Software

Updated 10 months ago

audiomate — Peer-reviewed • Rank 12.8 • Science 95%

audiomate: A Python package for working with audio datasets - Published in JOSS (2020)

audio audio-datasets corpus-tools data-loader dataset-creation dataset-filtering dataset-manager music noise speech speech-recognition

Artificial Intelligence and Machine Learning (40%)

Scientific Software · Peer-reviewed

Scientific Software

Updated 10 months ago

libfmp — Peer-reviewed • Rank 14.1 • Science 93%

libfmp: A Python Package for Fundamentals of Music Processing - Published in JOSS (2021)

audio beat chord dtw f0 fourier hmm mir music nmf onset processing python retrieval synchronization tempo

Engineering (40%)

Scientific Software · Peer-reviewed

Scientific Software

Updated 10 months ago

s(ound)lab — Peer-reviewed • Rank 12.0 • Science 93%

s(ound)lab: An easy to learn Python package for designing and running psychoacoustic experiments. - Published in JOSS (2021)

audio psychoacoustics sound

Neuroscience

Scientific Software · Peer-reviewed

Scientific Software

Updated 10 months ago

libsoni — Peer-reviewed • Rank 8.1 • Science 95%

libsoni: A Python Toolbox for Sonifying Music Annotations and Feature Representations - Published in JOSS (2024)

audio mir sonification

Mathematics

Scientific Software · Peer-reviewed

Updated 10 months ago

transformers • Rank 38.7 • Science 64%

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

audio deep-learning deepseek gemma glm hacktoberfest llm machine-learning model-hub natural-language-processing nlp pretrained-models python pytorch pytorch-transformers qwen speech-recognition transformer vlm

Updated 10 months ago

gladia-torchaudio • Rank 30.4 • Science 64%

Data manipulation and transformation for audio signal processing, powered by PyTorch

audio audio-processing io machine-learning python pytorch speech

Updated 10 months ago

librosa • Rank 29.4 • Science 59%

Python library for audio and music analysis

audio dsp librosa music python scipy

Updated 10 months ago

pyaudioanalysis • Rank 22.2 • Science 59%

Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications

audio audio-analysis-tasks audio-data machine-learning pyaudioanalysis python signal-processing

Updated 10 months ago

muspy • Rank 15.1 • Science 64%

A toolkit for symbolic music generation

audio machine-learning music music-generation music-information-retrieval python

Updated 10 months ago

musicaiz • Rank 9.9 • Science 67%

A python framework for symbolic music generation, evaluation and analysis

audio deep-learning machine-learning mir music music-generation music-information-retrieval python symbolic

Updated 10 months ago

aac-datasets • Rank 9.8 • Science 67%

Audio Captioning datasets for PyTorch.

audio audio-captioning caption captioning dataset datasets deep-learning pytorch

Updated 10 months ago

aac-metrics • Rank 9.6 • Science 67%

Metrics for evaluating Automated Audio Captioning systems, designed for PyTorch.

audio audio-captioning captioning metrics text

Updated 10 months ago

qdft • Rank 11.8 • Science 64%

Constant-Q Sliding DFT in C++, Rust and Python

algorithms audio audio-processing constant-q constant-q-transform cpp cqt dft digital-signal-processing dsp fft library python qdft rust sdft signal-processing sliding-dft variable-q vqt

Updated 10 months ago

pliers • Rank 14.0 • Science 54%

Automated feature extraction in Python

audio feature-extraction python video

Updated 9 months ago

https://github.com/fgnt/padertorch • Rank 13.5 • Science 54%

A collection of common functionality to simplify the design, training and evaluation of machine learning models based on pytorch with an emphasis on speech processing.

audio pytorch speech

Updated 10 months ago

dawdreamer • Rank 16.1 • Science 46%

Digital Audio Workstation with Python; VST instruments/effects, parameter automation, FAUST, JAX, Warp Markers, and JUCE processors

ableton audio audio-plugin audio-processing daw faust jax juce midi python synthesizer vst vst-host vst3 vst3-host

Updated 10 months ago

hpss • Rank 3.8 • Science 57%

Harmonic/Percussive Sound Separation

audio audio-signal-processing hpss music-information-retrieval music-signal-processing signal-processing

Updated 9 months ago

https://github.com/fgnt/nara_wpe • Rank 18.7 • Science 41%

Different implementations of "Weighted Prediction Error" for speech dereverberation

audio audio-processing dereverberation enhancement signal-processing

Updated 9 months ago

https://github.com/fgnt/paderbox • Rank 15.6 • Science 44%

Paderbox: A collection of utilities for audio / speech processing

audio speech toolbox

Updated 10 months ago

sdft • Rank 3.6 • Science 54%

Single file forward and inverse Sliding DFT in C, C++ and Python

algorithms audio c99 cpp cpp11 dft digital-signal-processing discrete-fourier-transform dsp fft fourier fourier-transform math python qdft sdft signal-processing sliding-dft spectral-analysis spectral-synthesis

Updated 10 months ago

huggingsound • Rank 13.3 • Science 44%

HuggingSound: A toolkit for speech-related tasks based on Hugging Face's tools

asr audio automatic-speech-recognition speech speech-recognition speech-to-text transformers

Updated 10 months ago

https://github.com/csteinmetz1/auraloss • Rank 19.0 • Science 33%

Collection of audio-focused loss functions in PyTorch

audio loss-functions pytorch

Updated 10 months ago

cavasik • Rank 7.3 • Science 44%

Audio visualizer based on CAVA

audio audio-visualizer cava eye-candy flatpak gnome python

Updated 10 months ago

llama_ros • Rank 7.2 • Science 44%

llama.cpp (GGUF LLMs) and llava.cpp (GGUF VLMs) for ROS 2

audio cpp embeddings ggml gguf gpt langchain llama llamacpp llava llavacpp llm multimodal rerank reranking ros2 vlm

Updated 10 months ago

python-sounddevice • Rank 24.9 • Science 26%

:sound: Play and Record Sound with Python :snake:

audio portaudio python soundcard

Updated 10 months ago

audioread • Rank 27.3 • Science 23%

cross-library (GStreamer + Core Audio + MAD + FFmpeg) audio decoding for Python

audio python

Updated 10 months ago

audiotree • Rank 5.4 • Science 44%

Audio data loading and augmentations in JAX

audio audio-augmentation dataloader jax

Updated 10 months ago

audio_common • Rank 4.2 • Science 44%

A PortAudio based audio_common with text to speech for ROS 2

audio espeak pyaudio ros2 text-to-speech tts

Updated 10 months ago

dx7-jax • Rank 2.7 • Science 44%

Yamaha DX7 synthesizer with JAX

audio dx7 faust jax synthesizer

Updated 10 months ago

wav2vec • Rank 9.4 • Science 36%

Python package and cli tool to convert wave files (WAV or AIFF) to vector graphics (SVG, PostScript, CVS)

aiff audio cli graphics postscript python python3 svg wav waveform

Updated 10 months ago

https://github.com/bytedance/salmonn • Rank 9.0 • Science 36%

SALMONN family: A suite of advanced multi-modal LLMs

audio audio-processing audio-visual-understanding bytedance iclr2024 icml-2024 large-language-models multi-modal music research speech speech-recognition tsinghua-university video video-understanding

Updated 10 months ago

pyo • Rank 20.7 • Science 23%

Python DSP module

audio c dsp music pyo python radio-pyo signal-processing sound synthesis

Updated 10 months ago

audiodeepfakedetection • Rank 5.9 • Science 36%

SUTD 50.039 Deep Learning Course Project (2022 Spring)

audio audio-deepfake-detection deep-learning deepfake-detection

Updated 10 months ago

riffusion-hobby • Rank 10.0 • Science 31%

Stable diffusion for real-time music generation

ai audio diffusers diffusion music stable-diffusion

Updated 10 months ago

mosqito • Rank 14.8 • Science 26%

MoSQITo is a unified and modular development framework of key sound quality metrics favoring reproducible science and efficient shared scripting among engineers, teachers and researchers community.

acoustic audio audio-analysis python signal-processing sound-quality sq-metrics

Updated 10 months ago

https://github.com/dbraun/dac-jax • Rank 3.2 • Science 36%

JAX Implementations of Descript Audio Codec and EnCodec

audio audio-codec audio-compression jax machine-learning

Updated 10 months ago

https://github.com/dbraun/abletonparsing • Rank 9.8 • Science 26%

Parse an Ableton ASD clip file (warp markers and more) in Python

ableton audio python rubberband time-stretching warp-markers

Updated 10 months ago

https://github.com/100mslive/100ms-flutter • Rank 7.8 • Science 26%

Flutter Live Streaming, Video Conferencing SDK & Sample App

audio chat conference dart ffmpeg flutter hacktoberfest hls live meeting player rtmp streaming video webrtc

Updated 10 months ago

conformer • Rank 6.1 • Science 23%

Implementation of the convolutional module from the Conformer paper, for use in Transformers

artificial-intelligence audio deep-learning transformer

Updated 10 months ago

https://github.com/csteinmetz1/automix-toolkit • Rank 6.0 • Science 23%

Models and datasets for training deep learning automatic mixing models

audio automatic-mixing deep-learning music

Updated 10 months ago

macossystemsounds • Rank 2.2 • Science 26%

macOS system sounds in multiple formats (including Base64.)

audio audio-alerts base64 macos system-sounds

Updated 10 months ago

https://github.com/fl33tw00d/whisper-turbo • Rank 13.3 • Science 13%

Cross-Platform, GPU Accelerated Whisper 🏎️

audio machine-learning rust speech-recognition webgpu whisper windows

Updated 10 months ago

response • Rank 12.8 • Science 13%

Your handy frequency and impulse response processing object

audio signal-processing

Updated 10 months ago

fastai • Rank 6.9 • Science 13%

R interface to fast.ai

audio collaborative-filtering darknet darknet-image-classification fastai medical object-detection r tabular text vision

Updated 10 months ago

https://github.com/audiolabs/trackswitch.js • Rank 6.6 • Science 13%

A Versatile Web-Based Audio Player for Presenting Scientific Results

audio player research science

Updated 10 months ago

https://github.com/dbraun/td-faust • Rank 5.5 • Science 13%

FAUST (Functional Audio Stream) for TouchDesigner

audio audio-processing dsp faust touchdesigner

Updated 10 months ago

librosax • Science 44%

Librosa in JAX

audio flax jax signal-processing

Updated 10 months ago

https://github.com/dbraun/vita • Science 26%

Python bindings to the Vital Synthesizer

audio modular-synthesizers python synthesizer

Updated 10 months ago

birdnet-go • Science 26%

Realtime BirdNET soundscape analyzer

artificial-intelligence audio bioacoustics birdnet birdnet-pi birds birdweather contributions-welcome go golang linux raspberry-pi raspberrypi svelte tailwindcss tensorflow wildlife

Updated 10 months ago

avocodo • Science 26%

A MATLAB-based GUI that optimizes the experience in audio/video coding EEG data.

audio coding eeg video

Updated 10 months ago

seansaudiodb • Science 26%

The personal version of my audio collection database. Not intended for public use. See the other version that is intended for public use: https://github.com/seanpm2001/AudiBass_Manager

audio audio-collection-database audiodb collection database html ini markdown mid mkv mp3 mp4 music ogg plain-text sfx sound wiki wlmp xml

Updated 10 months ago

stipa • Science 44%

MATLAB implementation of the Speech Transmission Index for Public Address (STIPA) method for evaluating the speech transmission quality.

audio evaluation intelligibility psychoacoustics speech speech-transmission-index stipa

Updated 10 months ago

lhotse • Science 54%

Tools for handling multimodal data in machine learning projects.

ai audio data deep-learning kaldi machine-learning python pytorch speech speech-recognition

Updated 10 months ago

birdeep_birdsongdetector_neuralnetworks • Science 49%

Repository for the neural networks and models created for the BIRDeep project

audio bioacoustics bird-detector bird-identification bird-song-detector birdeep birdnet deep-learning donana huelva neural-network pam passive-accoustic-monitoring python spain yolo yolov8

Updated 10 months ago

roomfuser • Science 54%

Acoustic impulse response generation using diffusion models

acoustics audio diffusion-models machine-learning neural-networks

Updated 10 months ago

https://github.com/aveek-saha/duskplayer • Science 13%

A minimal music player built on electron.

angularjs audio electron electron-app hacktoberfest howlerjs mp3 music music-player node-js player playlist shuffle volume

Updated 10 months ago

https://github.com/neodsp/neolab • Science 26%

neolab is a framework to load, analyze and manipulate audio data

audio audio-analysis dsp research

Updated 10 months ago

https://github.com/aaltoml/nonstationary-audio-gp • Science 10%

End-to-End Probabilistic Inference for Nonstationary Audio Analysis

audio gaussian-processes probabilistic-models

Updated 10 months ago

quantumaudio • Science 67%

A Python package for building Quantum Representations of Digital Audio. Developed by Moth.

audio quantum-computing signal-processing

Updated 10 months ago

radtts-uk • Science 67%

High-fidelity speech synthesis for Ukrainian using modern neural networks.

audio sound speech-uk synthesis text-to-speech tts ukrainian vocos wav wave

Updated 10 months ago

https://github.com/cuc-zihang-liu/audioevaluation • Science 26%

多种音频评估方法复现

audio clap clip evaluation lpips rewas

Updated 10 months ago

laughter-detection-icsi • Science 36%

A ML-pipeline for training a laughter detection model on the ICSI corpus

ai audio data deep-learning laughter lhotse machine-learning ml pytorch

Updated 10 months ago

beep • Science 26%

Beep is a single, 12-line, 358-character AppleScript that plays a system notification sound for the numeric value of the current hour in quick succession. (Think grandfather clock.)

applescript audio macos

Updated 10 months ago

2d3mf • Science 36%

Code and models for the paper "2D3MF: Deepfake Detection using Multi Modal Middle Fusion"

audio deep-learning deepfake-detection machine-learning multimodal pytorch video

Updated 10 months ago

https://github.com/bkraad47/fat_llama • Science 13%

fat_llama is a Python package for upscaling audio files to FLAC or WAV formats using advanced audio processing techniques. It utilizes CUDA-accelerated calculations to enhance audio quality by upsampling and adding missing frequencies through FFT, resulting in richer and more detailed audio.

audio audio-engineering audio-processing audiophile cuda cufft cupy fft flac hi-res hpc mp3 music nvidia ogg parallel-computing physics upscaling wav

Updated 10 months ago

https://github.com/carlosholivan/audiolm-google-torch • Science 10%

Implementation of the AudioLM model by Google in Pytorch

audio audio-generation audio-synthesis deep-learning music music-generation sound soundprocessing synthesis vq-vae

Updated 10 months ago

https://github.com/dbraun/chuckdesigner • Science 13%

ChucK audio integration with TouchDesigner

audio audiovisual chuck music touchdesigner

Updated 10 months ago

https://github.com/dbraun/faust-tutorial • Science 26%

Multi-day Faust Tutorial (syntax, plugins, microcontrollers, pedals)

audio dsp faust music plugins

Updated 10 months ago

snac • Science 54%

Multi-Scale Neural Audio Codec (SNAC) compresses audio into discrete codes at a low bitrate

audio audio-codec deep-learning

Updated 10 months ago

speechbrain • Science 64%

A PyTorch-based Speech Toolkit

asr audio audio-processing deep-learning huggingface language-model pytorch speaker-diarization speaker-recognition speaker-verification speech-enhancement speech-processing speech-recognition speech-separation speech-to-text speech-toolkit speechrecognition spoken-language-understanding transformers voice-recognition

Updated 10 months ago

bmt • Science 54%

Source code for "Bi-modal Transformer for Dense Video Captioning" (BMVC 2020)

activitynet-captions audio bi-modal-encoder bi-modal-transformer bmt bmvc bmvc20 dense-video-captioning i3d multimodal-fusion proposal-generator pytorch temporal-action-proposals transformer video video-features

Updated 10 months ago

https://github.com/csteinmetz1/ai-audio-startups • Science 13%

Community list of startups working with AI in audio and music technology

audio list music speech startups

Updated 10 months ago

anira • Science 57%

an architecture for neural network inference in real-time audio applications

audio audio-processing deep-learning libtorch onnxruntime real-time tensorflow-lite

Updated 10 months ago

nsynth-midi-renderer • Science 26%

Sample based concatenative synthesizer for the NSynth dataset. Render any MIDI (.mid) sequence with the notes of NSynth.

audio dataset maestro midi music nsynth nsynth-midi-renderer synthesizer

Updated 10 months ago

bird-song-detector • Science 49%

Bird Song Detector - Easy To Use Scripts and App

audio bioacoustics bird bird-detection birdsong deep deep-learning detector ecoinformatics gradio-interface learning neural-networks pam passive-acoustic-monitoring

Updated 10 months ago

eeg-cardio-audio-sleep • Science 13%

Project to study sound stimulus synchronous, asynchronous and isochronous with the heartbeat during sleep.

audio cardiac consciousness sleep

Updated 10 months ago

https://github.com/carlosholivan/audiogenerationdiffusion • Science 10%

State-of-the-art of Audio Generation with Diffusion Models

audio audio-generation deep-learning diffusion diffusion-models machine-learning multimodal-deep-learning

Updated 10 months ago

speech-utility-bioacoustics • Science 67%

On the utility of speech and audio foundation models for marmoset call analysis

audio bio-acoustics representation-learning self-supervised-learning speech

Updated 10 months ago

https://github.com/bluepixeldev/aftertone • Science 26%

A lightweight Unity plugin for audio pooling. This plugin aims to simplify playing throwaway abrupt sound effects such as gunshot, collision or UI sounds by utilizing a pool of prepared audio sources to reuse and play.

audio package pool pooling scriptableobject unity unity-audio unity-package unity3d-plugin unitypackage

Updated 10 months ago

open-in-mpv • Science 44%

Host-side of the extension to open any link or page URL in mpv via the browser context menu.

audio browser-extension mpv multimedia video

Updated 10 months ago

ltfat • Science 31%

Official development repository of the Large Time Frequency Analysis Toolbox

audio harmonic-analysis signal-processing

Updated 10 months ago

iossystemsounds • Science 26%

Sounds found in and extracted from System/Library/Audio/UISounds in multiple formats.

audio ios system-files system-sounds

Updated 10 months ago

microtonalview • Science 44%

Feel the untold tones—where notation falls short!

audio microtonal microtonalmusic music pitch pitch-contour pygame visualization

Updated 10 months ago

https://github.com/alexanderlerch/aca-slides • Science 26%

Slides and Code for "An Introduction to Audio Content Analysis," also taught at Georgia Tech as MUSI-6201. This introductory course on Music Information Retrieval is based on the text book "An Introduction to Audio Content Analysis", Wiley 2012/2022

audio audio-analysis audio-content-analysis audio-processing music music-information-retrieval

Updated 10 months ago

https://github.com/alexisvassquez/ai_spotibot_player • Science 26%

AudioMIX is an open-source, AI-driven music production software designed to empower independent artists and DJs with mood-based audio analysis, LED integration, and creative autonomy. Spotibot was its original name.

ai audio audio-analysis audio-processing audio-utility cmake cpp daw hardware machine-learning mood-detection music-production-software music-production-tools music-programming-language music-visualizer open-source portaudio python software-engineering spotipy-api

Updated 10 months ago

https://github.com/alexisvassquez/alexisvassquez • Science 26%

About me

audio creative-coding digital-design hardware modular-architecture music-production-software open-source-community software-engineer women-in-tech womenwhocode