rvadfast

This is the Python library for an unsupervised, fast method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsupervised Segment-Based Robust Voice Activity Detection Method.

https://github.com/zhenghuatan/rvadfast

Science Score: 44.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (3.0%) to scientific vocabulary

Keywords

voice-activity-detection
Last synced: 6 months ago · JSON representation ·

Repository

This is the Python library for an unsupervised, fast method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsupervised Segment-Based Robust Voice Activity Detection Method.

Basic Info
Statistics
  • Stars: 140
  • Watchers: 9
  • Forks: 24
  • Open Issues: 0
  • Releases: 0
Topics
voice-activity-detection
Created over 5 years ago · Last pushed 9 months ago
Metadata Files
Readme License Citation

README.TXT

Fast noise-robust voice activity detection algorithm (rVAD-fast).
Version 2.0

02 Dec 2017, Achintya Kumar Sarkar and Zheng-Hua Tan

Usage: python rVAD_fast_2.0.py inWaveFile  outputVadLabel

Refs:
[1] Z.-H. Tan, A.k. Sarkara and N. Dehak, "rVAD: an unsupervised segment-based robust voice activity detection method," Computer Speech and Language, vol. 59, pp. 1-21, 2020. 
[2] Z.-H. Tan and B. Lindberg, "Low-complexity variable frame rate analysis for speech recognition and voice activity detection,” IEEE Journal of Selected Topics in Signal Processing, vol. 4, no. 5, pp. 798-807, 2010.

Contact:
  Prof Zheng-Hua Tan
  Aalborg University, Denmark
  zt@es.aau.dk
  https://vbn.aau.dk/en/persons/107665  

Owner

  • Name: Zheng-Hua Tan
  • Login: zhenghuatan
  • Kind: user
  • Location: Aalborg, Denmark
  • Company: Aalborg University

Professor of Machine Learning and Speech Processing

Citation (CITATION.cff)

cff-version: 1.2.0
message: "If you use this software, please cite it as below."
authors:
- family-names: "Tan"
  given-names: "Zheng-Hua"
  orcid: "https://orcid.org/0000-0001-6856-8928"
- family-names: "Sarkar"
  given-names: "Achintya"
  orcid: "https://orcid.org/0000-0002-9870-3980"
- family-names: "Najim"
  given-names: "Dehak"
  orcid: "https://orcid.org/0000-0002-4489-5753"
title: "rVADfast - a fast and robust unsupervised VAD"
version: 0.0.2
doi: 10.1016/j.csl.2019.06.005
date-released: 2024-01-23
url: "https://github.com/zhenghuatan/rVADfast"
license: MIT
preferred-citation:
  type: article
  authors:
  - family-names: "Tan"
    given-names: "Zheng-Hua"
    orcid: "https://orcid.org/0000-0001-6856-8928"
  - family-names: "Sartar"
    given-names: "Achintya"
    orcid: "https://orcid.org/0000-0002-9870-3980"
  - family-names: "Najim"
    given-names: "Dehak"
    orcid: "https://orcid.org/0000-0002-4489-5753"
  doi: "10.1016/j.csl.2019.06.005"
  journal: "Computer Speech and Language"
  start: 1 # First page number
  end: 21 # Last page number
  title: "rVAD: An unsupervised segment-based robust voice activity detection method"
  volume: 59
  year: 2020
  

GitHub Events

Total
  • Issues event: 2
  • Watch event: 14
  • Delete event: 1
  • Issue comment event: 3
  • Push event: 12
  • Pull request event: 6
  • Fork event: 2
  • Create event: 1
Last Year
  • Issues event: 2
  • Watch event: 14
  • Delete event: 1
  • Issue comment event: 3
  • Push event: 12
  • Pull request event: 6
  • Fork event: 2
  • Create event: 1

Dependencies

pyproject.toml pypi
  • audiofile >= 1.1.1
  • numpy >= 1.23.5
  • scipy >= 1.10.0
  • tqdm >= 4.64.1