monotonic-alignment-search
Monotonically align text and speech
Science Score: 44.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
○DOI references
-
○Academic publication links
-
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (10.7%) to scientific vocabulary
Keywords
Repository
Monotonically align text and speech
Basic Info
Statistics
- Stars: 2
- Watchers: 2
- Forks: 0
- Open Issues: 0
- Releases: 2
Topics
Metadata Files
README.md
Monotonic Alignment Search (MAS)
Implementation of MAS from Glow-TTS for easy reuse in other projects.
Installation
bash
pip install monotonic-alignment-search
Wheels are provided for Linux, Mac, and Windows. Pytorch is not installed by default. You either first need to install it yourself, or install one of the following extras with uv:
bash
uv add monotonic-alignment-search[cpu]
uv add monotonic-alignment-search[cuda]
Usage
MAS can find the most probable alignment between a text sequence t_x and a
speech sequence t_y.
```python from monotonicalignmentsearch import maximum_path
value (torch.Tensor): [batchsize, tx, t_y]
mask (torch.Tensor): [batchsize, tx, t_y]
path = maximum_path(value, mask, implementation="cython") ```
The implementation argument allows choosing from one of the following
implementations:
cython(default): Cython-optimisednumpy: pure Numpy
References
This implementation is taken from the original Glow-TTS repository. Consider citing the Glow-TTS paper when using this project:
bibtex
@inproceedings{kim2020_glowtts,
title={Glow-{TTS}: A Generative Flow for Text-to-Speech via Monotonic Alignment Search},
author={Jaehyeon Kim and Sungwon Kim and Jungil Kong and Sungroh Yoon},
booktitle={Proceedings of Neur{IPS}},
year={2020},
}
Owner
- Name: Enno Hermann
- Login: eginhard
- Kind: user
- Location: Lausanne, Switzerland
- Company: @Idiap Research Institute
- Website: https://enno.xyz
- Repositories: 24
- Profile: https://github.com/eginhard
Postdoc @ Idiap Research Institute. ASR & TTS.
Citation (CITATION.cff)
# SPDX-FileCopyrightText: Enno Hermann
#
# SPDX-License-Identifier: MIT
cff-version: 1.2.0
message: "If you use this software, please cite it as below. Also consider citing the Glow-TTS paper."
title: "monotonic-alignment-search"
abstract: "Implementation of monotonic alignment search from Glow-TTS"
date-released: 2024
authors:
- family-names: Hermann
given-names: Enno
version: 0.1.1
license: "MIT"
url: "https://github.com/eginhard/monotonic_alignment_search"
repository-code: "https://github.com/eginhard/monotonic_alignment_search"
GitHub Events
Total
- Release event: 3
- Watch event: 1
- Delete event: 3
- Push event: 53
- Pull request event: 6
- Create event: 7
Last Year
- Release event: 3
- Watch event: 1
- Delete event: 3
- Push event: 53
- Pull request event: 6
- Create event: 7
Packages
- Total packages: 1
-
Total downloads:
- pypi 53,083 last-month
- Total dependent packages: 0
- Total dependent repositories: 0
- Total versions: 3
- Total maintainers: 1
pypi.org: monotonic-alignment-search
Monotonically align text and speech
- Documentation: https://monotonic-alignment-search.readthedocs.io/
- License: MIT
-
Latest release: 0.2.0
published 9 months ago
Rankings
Maintainers (1)
Dependencies
- actions/checkout v4 composite
- astral-sh/setup-uv v3 composite
- actions/checkout v4 composite
- actions/download-artifact v4 composite
- actions/upload-artifact v4 composite
- astral-sh/setup-uv v3 composite
- numpy >=1.21.6
- torch >=1.12.1
- actions/checkout v4 composite
- actions/download-artifact v4 composite
- actions/setup-python v5 composite
- actions/upload-artifact v4 composite
- pypa/cibuildwheel v2.21.3 composite
- pypa/gh-action-pypi-publish release/v1 composite