monotonic-alignment-search

Monotonically align text and speech

https://github.com/eginhard/monotonic_alignment_search

Science Score: 44.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (10.7%) to scientific vocabulary

Keywords

python speech speech-synthesis text-to-speech tts
Last synced: 6 months ago · JSON representation ·

Repository

Monotonically align text and speech

Basic Info
  • Host: GitHub
  • Owner: eginhard
  • License: mit
  • Language: Python
  • Default Branch: main
  • Homepage:
  • Size: 20.5 KB
Statistics
  • Stars: 2
  • Watchers: 2
  • Forks: 0
  • Open Issues: 0
  • Releases: 2
Topics
python speech speech-synthesis text-to-speech tts
Created over 1 year ago · Last pushed 9 months ago
Metadata Files
Readme License Citation

README.md

Monotonic Alignment Search (MAS)

PyPI - License PyPI - Python Version PyPI - Version GithubActions GithubActions

Implementation of MAS from Glow-TTS for easy reuse in other projects.

Installation

bash pip install monotonic-alignment-search

Wheels are provided for Linux, Mac, and Windows. Pytorch is not installed by default. You either first need to install it yourself, or install one of the following extras with uv:

bash uv add monotonic-alignment-search[cpu] uv add monotonic-alignment-search[cuda]

Usage

MAS can find the most probable alignment between a text sequence t_x and a speech sequence t_y.

```python from monotonicalignmentsearch import maximum_path

value (torch.Tensor): [batchsize, tx, t_y]

mask (torch.Tensor): [batchsize, tx, t_y]

path = maximum_path(value, mask, implementation="cython") ```

The implementation argument allows choosing from one of the following implementations:

  • cython (default): Cython-optimised
  • numpy: pure Numpy

References

This implementation is taken from the original Glow-TTS repository. Consider citing the Glow-TTS paper when using this project:

bibtex @inproceedings{kim2020_glowtts, title={Glow-{TTS}: A Generative Flow for Text-to-Speech via Monotonic Alignment Search}, author={Jaehyeon Kim and Sungwon Kim and Jungil Kong and Sungroh Yoon}, booktitle={Proceedings of Neur{IPS}}, year={2020}, }

Owner

  • Name: Enno Hermann
  • Login: eginhard
  • Kind: user
  • Location: Lausanne, Switzerland
  • Company: @Idiap Research Institute

Postdoc @ Idiap Research Institute. ASR & TTS.

Citation (CITATION.cff)

# SPDX-FileCopyrightText: Enno Hermann
#
# SPDX-License-Identifier: MIT

cff-version: 1.2.0
message: "If you use this software, please cite it as below. Also consider citing the Glow-TTS paper."
title: "monotonic-alignment-search"
abstract: "Implementation of monotonic alignment search from Glow-TTS"
date-released: 2024
authors:
  - family-names: Hermann
    given-names: Enno
version: 0.1.1
license: "MIT"
url: "https://github.com/eginhard/monotonic_alignment_search"
repository-code: "https://github.com/eginhard/monotonic_alignment_search"

GitHub Events

Total
  • Release event: 3
  • Watch event: 1
  • Delete event: 3
  • Push event: 53
  • Pull request event: 6
  • Create event: 7
Last Year
  • Release event: 3
  • Watch event: 1
  • Delete event: 3
  • Push event: 53
  • Pull request event: 6
  • Create event: 7

Packages

  • Total packages: 1
  • Total downloads:
    • pypi 53,083 last-month
  • Total dependent packages: 0
  • Total dependent repositories: 0
  • Total versions: 3
  • Total maintainers: 1
pypi.org: monotonic-alignment-search

Monotonically align text and speech

  • Versions: 3
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Downloads: 53,083 Last month
Rankings
Dependent packages count: 10.1%
Average: 33.4%
Dependent repos count: 56.7%
Maintainers (1)
Last synced: 6 months ago

Dependencies

.github/workflows/lint.yml actions
  • actions/checkout v4 composite
  • astral-sh/setup-uv v3 composite
.github/workflows/test.yml actions
  • actions/checkout v4 composite
  • actions/download-artifact v4 composite
  • actions/upload-artifact v4 composite
  • astral-sh/setup-uv v3 composite
pyproject.toml pypi
  • numpy >=1.21.6
  • torch >=1.12.1
setup.py pypi
.github/workflows/pypi-release.yml actions
  • actions/checkout v4 composite
  • actions/download-artifact v4 composite
  • actions/setup-python v5 composite
  • actions/upload-artifact v4 composite
  • pypa/cibuildwheel v2.21.3 composite
  • pypa/gh-action-pypi-publish release/v1 composite