Scientific Software
Updated 9 months ago

Software Design and User Interface of ESPnet-SE++ — Peer-reviewed • Rank 25.3 • Science 95%

Software Design and User Interface of ESPnet-SE++: Speech Enhancement for Robust Speech Processing - Published in JOSS (2023)

Scientific Software
Updated 9 months ago

SpeechPy - A Library for Speech Processing and Recognition — Peer-reviewed • Rank 16.6 • Science 93%

SpeechPy - A Library for Speech Processing and Recognition - Published in JOSS (2018)

Mathematics
Scientific Software · Peer-reviewed
Scientific Software
Updated 9 months ago

audiomate — Peer-reviewed • Rank 12.8 • Science 95%

audiomate: A Python package for working with audio datasets - Published in JOSS (2020)

Artificial Intelligence and Machine Learning (40%)
Scientific Software · Peer-reviewed
Updated 9 months ago

transformers • Rank 38.7 • Science 64%

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Updated 9 months ago

huggingsound • Rank 13.3 • Science 44%

HuggingSound: A toolkit for speech-related tasks based on Hugging Face's tools

Updated 9 months ago

stt • Rank 21.4 • Science 33%

🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.

Updated 9 months ago

https://github.com/coqui-ai/stt-model-manager • Rank 12.1 • Science 36%

Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo

Updated 9 months ago

https://github.com/bagustris/speech-recognition-course • Rank 1.4 • Science 39%

Material for learning speech recognition, based on Microsoft teaching material on EdX

Updated 9 months ago

https://github.com/ccoreilly/vosk-browser • Rank 17.8 • Science 13%

A speech recognition library running in the browser thanks to a WebAssembly build of Vosk

Updated 9 months ago

https://github.com/bagustris/id • Rank 1.9 • Science 26%

Iban-based Kaldi recipe for Indonesian speech Corpus, presented at ASJ Spring 2019.

Updated 9 months ago

SpecAugment • Rank 8.5 • Science 10%

A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain

Updated 8 months ago

https://github.com/dcavar/elan2split • Rank 3.0 • Science 13%

Split ELAN Annotation Files and corresponding speech files into a corpus format for common ASR and Forced Aligners

Updated 9 months ago

speech-recognition • Science 36%

Develop speech recognition models with Tensorflow 2

Updated 9 months ago

pinyin-to-ipa • Science 67%

Command-line interface and Python library to transcribe pinyin to IPA. The tones are attached to the vowel of the syllable.

Updated 9 months ago

https://github.com/bagustris/book-ser • Science 26%

Codes for the book: Speech Emotion Recognition: Theory and Practice

Updated 9 months ago

lhotse • Science 54%

Tools for handling multimodal data in machine learning projects.

Updated 9 months ago

balena • Science 44%

BALanced Execution through Natural Activation : a human-computer interaction methodology for code running.

Updated 9 months ago

multilingual-asr • Science 54%

Multilingual Speech Recognition for Indonesian Languages

Updated 9 months ago

https://github.com/awslabs/speech-representations • Science 10%

Code for DeCoAR (ICASSP 2020) and BERTphone (Odyssey 2020)

Updated 9 months ago

https://github.com/audiollms/audiobench • Science 36%

AudioBench: A Universal Benchmark for Audio Large Language Models

Updated 9 months ago

https://github.com/awslabs/mlm-scoring • Science 10%

Python library & examples for Masked Language Model Scoring (ACL 2020)

Updated 9 months ago

speech-recognition-uk • Science 44%

🇺🇦 Speech Recognition & Synthesis for Ukrainian

Updated 9 months ago

cv10-uk-testset-clean • Science 44%

The cleaned Common Voice 10 (test set) that has been checked by a human for Ukrainian 🇺🇦

Updated 9 months ago

allophant • Science 44%

A multilingual phoneme recognizer capable of generalizing zero-shot to unseen phoneme inventories.

Updated 9 months ago

https://github.com/breandan/hello-robot • Science 13%

A speech interface for controlling our robot.