rvadfast
This is the Python library for an unsupervised, fast method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsupervised Segment-Based Robust Voice Activity Detection Method.
Science Score: 44.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
○DOI references
-
○Academic publication links
-
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (3.0%) to scientific vocabulary
Keywords
voice-activity-detection
Last synced: 6 months ago
·
JSON representation
·
Repository
This is the Python library for an unsupervised, fast method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsupervised Segment-Based Robust Voice Activity Detection Method.
Basic Info
- Host: GitHub
- Owner: zhenghuatan
- License: mit
- Language: Python
- Default Branch: master
- Homepage: https://www.sciencedirect.com/science/article/pii/S0885230819300920
- Size: 3.63 MB
Statistics
- Stars: 140
- Watchers: 9
- Forks: 24
- Open Issues: 0
- Releases: 0
Topics
voice-activity-detection
Created over 5 years ago
· Last pushed 9 months ago
Metadata Files
Readme
License
Citation
README.TXT
Fast noise-robust voice activity detection algorithm (rVAD-fast). Version 2.0 02 Dec 2017, Achintya Kumar Sarkar and Zheng-Hua Tan Usage: python rVAD_fast_2.0.py inWaveFile outputVadLabel Refs: [1] Z.-H. Tan, A.k. Sarkara and N. Dehak, "rVAD: an unsupervised segment-based robust voice activity detection method," Computer Speech and Language, vol. 59, pp. 1-21, 2020. [2] Z.-H. Tan and B. Lindberg, "Low-complexity variable frame rate analysis for speech recognition and voice activity detection,” IEEE Journal of Selected Topics in Signal Processing, vol. 4, no. 5, pp. 798-807, 2010. Contact: Prof Zheng-Hua Tan Aalborg University, Denmark zt@es.aau.dk https://vbn.aau.dk/en/persons/107665
Owner
- Name: Zheng-Hua Tan
- Login: zhenghuatan
- Kind: user
- Location: Aalborg, Denmark
- Company: Aalborg University
- Website: https://vbn.aau.dk/en/persons/107665
- Twitter: Z_H_Tan
- Repositories: 4
- Profile: https://github.com/zhenghuatan
Professor of Machine Learning and Speech Processing
Citation (CITATION.cff)
cff-version: 1.2.0
message: "If you use this software, please cite it as below."
authors:
- family-names: "Tan"
given-names: "Zheng-Hua"
orcid: "https://orcid.org/0000-0001-6856-8928"
- family-names: "Sarkar"
given-names: "Achintya"
orcid: "https://orcid.org/0000-0002-9870-3980"
- family-names: "Najim"
given-names: "Dehak"
orcid: "https://orcid.org/0000-0002-4489-5753"
title: "rVADfast - a fast and robust unsupervised VAD"
version: 0.0.2
doi: 10.1016/j.csl.2019.06.005
date-released: 2024-01-23
url: "https://github.com/zhenghuatan/rVADfast"
license: MIT
preferred-citation:
type: article
authors:
- family-names: "Tan"
given-names: "Zheng-Hua"
orcid: "https://orcid.org/0000-0001-6856-8928"
- family-names: "Sartar"
given-names: "Achintya"
orcid: "https://orcid.org/0000-0002-9870-3980"
- family-names: "Najim"
given-names: "Dehak"
orcid: "https://orcid.org/0000-0002-4489-5753"
doi: "10.1016/j.csl.2019.06.005"
journal: "Computer Speech and Language"
start: 1 # First page number
end: 21 # Last page number
title: "rVAD: An unsupervised segment-based robust voice activity detection method"
volume: 59
year: 2020
GitHub Events
Total
- Issues event: 2
- Watch event: 14
- Delete event: 1
- Issue comment event: 3
- Push event: 12
- Pull request event: 6
- Fork event: 2
- Create event: 1
Last Year
- Issues event: 2
- Watch event: 14
- Delete event: 1
- Issue comment event: 3
- Push event: 12
- Pull request event: 6
- Fork event: 2
- Create event: 1
Dependencies
pyproject.toml
pypi
- audiofile >= 1.1.1
- numpy >= 1.23.5
- scipy >= 1.10.0
- tqdm >= 4.64.1