rvadfast

This is the Python library for an unsupervised, fast method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsupervised Segment-Based Robust Voice Activity Detection Method.

https://github.com/zhenghuatan/rvadfast

Science Score: 44.0%

This score indicates how likely this project is to be science-related based on various indicators:

✓
CITATION.cff file
Found CITATION.cff file
✓
codemeta.json file
Found codemeta.json file
✓
.zenodo.json file
Found .zenodo.json file
○
DOI references
○
Academic publication links
○
Academic email domains
○
Institutional organization owner
○
JOSS paper metadata
○
Scientific vocabulary similarity
Low similarity (3.0%) to scientific vocabulary

Keywords

voice-activity-detection

Last synced: 10 months ago · JSON representation ·

Repository

This is the Python library for an unsupervised, fast method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsupervised Segment-Based Robust Voice Activity Detection Method.

Basic Info

Host: GitHub
Owner: zhenghuatan
License: mit
Language: Python
Default Branch: master
Homepage: https://www.sciencedirect.com/science/article/pii/S0885230819300920
Size: 3.63 MB

Statistics

Stars: 140
Watchers: 9
Forks: 24
Open Issues: 0
Releases: 0

Topics

voice-activity-detection

Created almost 6 years ago · Last pushed about 1 year ago

Metadata Files

Readme License Citation

README.TXT

Fast noise-robust voice activity detection algorithm (rVAD-fast).
Version 2.0

02 Dec 2017, Achintya Kumar Sarkar and Zheng-Hua Tan

Usage: python rVAD_fast_2.0.py inWaveFile  outputVadLabel

Refs:
[1] Z.-H. Tan, A.k. Sarkara and N. Dehak, "rVAD: an unsupervised segment-based robust voice activity detection method," Computer Speech and Language, vol. 59, pp. 1-21, 2020. 
[2] Z.-H. Tan and B. Lindberg, "Low-complexity variable frame rate analysis for speech recognition and voice activity detection,” IEEE Journal of Selected Topics in Signal Processing, vol. 4, no. 5, pp. 798-807, 2010.

Contact:
  Prof Zheng-Hua Tan
  Aalborg University, Denmark
  zt@es.aau.dk
  https://vbn.aau.dk/en/persons/107665

Owner

Name: Zheng-Hua Tan
Login: zhenghuatan
Kind: user
Location: Aalborg, Denmark
Company: Aalborg University

Website: https://vbn.aau.dk/en/persons/107665
Twitter: Z_H_Tan
Repositories: 4
Profile: https://github.com/zhenghuatan

Professor of Machine Learning and Speech Processing

Citation (CITATION.cff)

cff-version: 1.2.0
message: "If you use this software, please cite it as below."
authors:
- family-names: "Tan"
  given-names: "Zheng-Hua"
  orcid: "https://orcid.org/0000-0001-6856-8928"
- family-names: "Sarkar"
  given-names: "Achintya"
  orcid: "https://orcid.org/0000-0002-9870-3980"
- family-names: "Najim"
  given-names: "Dehak"
  orcid: "https://orcid.org/0000-0002-4489-5753"
title: "rVADfast - a fast and robust unsupervised VAD"
version: 0.0.2
doi: 10.1016/j.csl.2019.06.005
date-released: 2024-01-23
url: "https://github.com/zhenghuatan/rVADfast"
license: MIT
preferred-citation:
  type: article
  authors:
  - family-names: "Tan"
    given-names: "Zheng-Hua"
    orcid: "https://orcid.org/0000-0001-6856-8928"
  - family-names: "Sartar"
    given-names: "Achintya"
    orcid: "https://orcid.org/0000-0002-9870-3980"
  - family-names: "Najim"
    given-names: "Dehak"
    orcid: "https://orcid.org/0000-0002-4489-5753"
  doi: "10.1016/j.csl.2019.06.005"
  journal: "Computer Speech and Language"
  start: 1 # First page number
  end: 21 # Last page number
  title: "rVAD: An unsupervised segment-based robust voice activity detection method"
  volume: 59
  year: 2020

GitHub Events

Total

Issues event: 2
Watch event: 14
Delete event: 1
Issue comment event: 3
Push event: 12
Pull request event: 6
Fork event: 2
Create event: 1

Last Year

Issues event: 2
Watch event: 14
Delete event: 1
Issue comment event: 3
Push event: 12
Pull request event: 6
Fork event: 2
Create event: 1

Dependencies

pyproject.toml pypi

audiofile >= 1.1.1
numpy >= 1.23.5
scipy >= 1.10.0
tqdm >= 4.64.1

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Open Source Science