Updated 9 months ago
mexca
Multimodal Emotion eXpression Capture Amsterdam. Pipeline for capturing emotion expressions from multiple modalities (video, audio, text) in the wild.
Updated 9 months ago
platalea
Library for training visually-grounded models of spoken language understanding.
Updated 9 months ago
speechbrain
A PyTorch-based Speech Toolkit
asr
audio
audio-processing
deep-learning
huggingface
language-model
pytorch
speaker-diarization
speaker-recognition
speaker-verification
speech-enhancement
speech-processing
speech-recognition
speech-separation
speech-to-text
speech-toolkit
speechrecognition
spoken-language-understanding
transformers
voice-recognition