waveglow-cli

Command-line interface (CLI) to train WaveGlow using .wav files.

https://github.com/stefantaubert/waveglow-cli

Science Score: 67.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
    Found 7 DOI reference(s) in README
  • Academic publication links
    Links to: ieee.org, zenodo.org
  • Committers with academic emails
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (11.7%) to scientific vocabulary
Last synced: 6 months ago · JSON representation ·

Repository

Command-line interface (CLI) to train WaveGlow using .wav files.

Basic Info
  • Host: GitHub
  • Owner: stefantaubert
  • License: mit
  • Language: Python
  • Default Branch: master
  • Homepage:
  • Size: 834 KB
Statistics
  • Stars: 1
  • Watchers: 1
  • Forks: 0
  • Open Issues: 1
  • Releases: 1
Created almost 5 years ago · Last pushed over 1 year ago
Metadata Files
Readme Changelog Contributing License Code of conduct Citation

README.md

waveglow-cli

PyPI PyPI MIT PyPI PyPI PyPI DOI

Command-line interface (CLI) to train WaveGlow using .wav files.

Features

  • train/synthesize on CPU or GPU
  • download pre-trained models by Nvidia

Installation

sh pip install waveglow-cli --user

Usage

```txt usage: waveglow-cli [-h] [-v] {download,train,continue-train,validate,synthesize,synthesize-wav} ...

This program trains WaveGlow.

positional arguments: {download,train,continue-train,validate,synthesize,synthesize-wav} description download download pre-trained checkpoint from Nvidia train start training continue-train continue training validate validate checkpoint(s) synthesize synthesize mel-spectrograms into an audio signal synthesize-wav synthesize audio file into an audio signal

options: -h, --help show this help message and exit -v, --version show program's version number and exit ```

Pretrained Models

Audio Example

"The North Wind and the Sun were disputing which was the stronger, when a traveler came along wrapped in a warm cloak." Listen here (headphones recommended)

Roadmap

  • Outsource method to convert audio files to mel-spectrograms before training
  • Improve logging
  • Add more audio examples
  • Adding tests

Development setup

```sh

update

sudo apt update

install Python 3.8-3.11 for ensuring that tests can be run

sudo apt install python3-pip \ python3.8 python3.8-dev python3.8-distutils python3.8-venv \ python3.9 python3.9-dev python3.9-distutils python3.9-venv \ python3.10 python3.10-dev python3.10-distutils python3.10-venv \ python3.11 python3.11-dev python3.11-distutils python3.11-venv

install pipenv for creation of virtual environments

python3.8 -m pip install pipenv --user

check out repo

git clone https://github.com/stefantaubert/waveglow.git cd waveglow

create virtual environment

python3.8 -m pipenv install --dev ```

Running the tests

```sh

first install the tool like in "Development setup"

then, navigate into the directory of the repo (if not already done)

cd waveglow

activate environment

python3.8 -m pipenv shell

run tests

tox ```

Final lines of test result output:

log py38: commands succeeded py39: commands succeeded py310: commands succeeded py311: commands succeeded congratulations :)

License

MIT License

Acknowledgments

Model code adapted from Nvidia.

Paper:

Funded by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) – Project-ID 416228727 – CRC 1410

Citation

If you want to cite this repo, you can use the BibTeX-entry generated by GitHub (see About => Cite this repository).

txt Taubert, S. (2024). waveglow-cli (Version 0.0.2) [Computer software]. [https://doi.org/10.5281/zenodo.10569141](https://doi.org/10.5281/zenodo.10569141)

Owner

  • Name: Stefan Taubert
  • Login: stefantaubert
  • Kind: user
  • Location: Chemnitz, Germany
  • Company: Chemnitz University of Technology

Currently I am working on my PhD about the topic of speech synthesis at Chemnitz University of Technology.

Citation (CITATION.cff)

cff-version: 1.2.0
title: waveglow-cli
abstract: Command-line interface (CLI) to train WaveGlow using .wav files.
message: >-
  If you use this software, please cite it using the
  metadata from this file.
type: software
authors:
  - email: github@stefantaubert.com
    given-names: Stefan
    family-names: Taubert
    affiliation: Chemnitz University of Technology
    orcid: 'https://orcid.org/0000-0002-4932-2874'
    website: 'https://stefantaubert.com/'
version: 0.0.2
date-released: 2024-01-25
license: MIT
url: https://github.com/stefantaubert/waveglow
doi: 10.5281/zenodo.10569141

GitHub Events

Total
Last Year

Committers

Last synced: 9 months ago

All Time
  • Total Commits: 70
  • Total Committers: 1
  • Avg Commits per committer: 70.0
  • Development Distribution Score (DDS): 0.0
Past Year
  • Commits: 1
  • Committers: 1
  • Avg Commits per committer: 1.0
  • Development Distribution Score (DDS): 0.0
Top Committers
Name Email Commits
Stefan Taubert s****t@p****e 70
Committer Domains (Top 20 + Academic)

Issues and Pull Requests

Last synced: 10 months ago

All Time
  • Total issues: 1
  • Total pull requests: 0
  • Average time to close issues: N/A
  • Average time to close pull requests: N/A
  • Total issue authors: 1
  • Total pull request authors: 0
  • Average comments per issue: 1.0
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 0
  • Pull requests: 0
  • Average time to close issues: N/A
  • Average time to close pull requests: N/A
  • Issue authors: 0
  • Pull request authors: 0
  • Average comments per issue: 0
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
  • aleph1 (1)
Pull Request Authors
Top Labels
Issue Labels
Pull Request Labels

Dependencies

Pipfile pypi
  • autoflake * develop
  • autopep8 * develop
  • isort * develop
  • pycodestyle * develop
  • pylint * develop
  • pytest * develop
  • rope * develop
  • tox * develop
  • twine * develop
  • waveglow * develop
  • Pillow *
  • Unidecode *
  • fastdtw *
  • gdown *
  • imageio *
  • librosa *
  • matplotlib *
  • mel-cepstral-distance >=0.0.1
  • numpy *
  • ordered-set >=4.1.0
  • pandas *
  • plotly *
  • scikit-image *
  • scikit-learn *
  • scipy *
  • tensorboard *
  • torch *
  • tqdm *
  • wget *
Pipfile.lock pypi
  • 163 dependencies
pyproject.toml pypi
  • Pillow *
  • Unidecode *
  • fastdtw *
  • gdown *
  • librosa *
  • matplotlib *
  • mel-cepstral-distance >=0.0.1
  • numpy *
  • ordered_set >=4.1.0
  • pandas *
  • plotly *
  • scikit-image *
  • scikit-learn *
  • scipy *
  • torch *
  • tqdm *
  • wget *