dict-from-dragonmapper
CLI to convert a Chinese vocabulary to IPA using dragonmapper.
Science Score: 44.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
○DOI references
-
○Academic publication links
-
○Committers with academic emails
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (8.0%) to scientific vocabulary
Repository
CLI to convert a Chinese vocabulary to IPA using dragonmapper.
Basic Info
- Host: GitHub
- Owner: stefantaubert
- License: mit
- Language: Python
- Default Branch: master
- Size: 113 KB
Statistics
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
- Releases: 2
Metadata Files
README.md
dict-from-dragonmapper
Command-line interface (CLI) to create a pronunciation dictionary by looking up IPA transcriptions using dragonmapper including the possibility of ignoring punctuation and splitting words on hyphens before transcribing them. Instead of attaching the tones to the syllable the tones are attached to the vowel of the syllable.
Installation
sh
pip install dict-from-dragonmapper --user
Usage
sh
dict-from-dragonmapper-cli
Example
```sh
Create example vocabulary
cat > /tmp/vocabulary.txt << EOF 社会语言学? 㐻, 『㑐 鲜-亮。 『占斌? 『机具-机呀? EOF
Create dictionary from vocabulary
dict-from-dragonmapper-cli \ /tmp/vocabulary.txt \ /tmp/result.dict \ --split-on-hyphen \ --n-jobs 4
cat /tmp/result.dict ```
Output:
txt
社会语言学? ʂ ɤ˥˩ x w eɪ˥˩ y˧˩˧ j ɛ˧˥ n ɕ ɥ œ˧˥ ?
㐻, n eɪ˥˩ ,
『㑐 『 ʂ u˥
鲜-亮。 ɕ j ɛ˥ n - l j ɑ˥˩ ŋ 。
『占斌? 『 ʈʂ a˥˩ n p i˥ n ?
『机具-机呀? 『 tɕ i˥ tɕ y˥˩ - tɕ i˥ j a ?
Phoneme Set
txt
a
aɪ
eɪ
f
i
j
k
kʰ
l
m
n
oʊ
p
pʰ
s
t
ts
tsʰ
tɕ
tɕʰ
tʰ
u
w
x
y
ŋ
œ
ɑ
ɑʊ
ɔ
ɕ
ə
ɛ
ɤ
ɥ
ɨ
ɯ
ɻ
ʂ
ʈʂ
ʈʂʰ
ʊ
ʐ
Vowels and diphthongs contain one of these tones:
txt
˥ (first tone)
˧˥ (second tone)
˧˩˧ (third tone)
˥˩ (fourth tone)
(none)
Dependencies
pronunciation-dictionary >= 0.0.4ordered-set >= 4.1.0word-to-pronunciation >= 0.0.1dragonmapper >=0.2.6, < 0.3tqdm
License
MIT License
Acknowledgments
Funded by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) – Project-ID 416228727 – CRC 1410
Citation
If you want to cite this repo, you can use this BibTeX-entry generated by GitHub (see About => Cite this repository).
Owner
- Name: Stefan Taubert
- Login: stefantaubert
- Kind: user
- Location: Chemnitz, Germany
- Company: Chemnitz University of Technology
- Website: https://stefantaubert.com
- Twitter: Stefan_Taubert
- Repositories: 75
- Profile: https://github.com/stefantaubert
Currently I am working on my PhD about the topic of speech synthesis at Chemnitz University of Technology.
Citation (CITATION.cff)
cff-version: 1.2.0
title: dict-from-dragonmapper
abstract: Command-line interface (CLI) to create a pronunciation dictionary by looking up IPA transcriptions using dragonmapper including the possibility of ignoring punctuation and splitting words on hyphens before transcribing them.
message: >-
If you use this software, please cite it using the
metadata from this file.
type: software
authors:
- email: github@stefantaubert.com
given-names: Stefan
family-names: Taubert
affiliation: Chemnitz University of Technology
orcid: 'https://orcid.org/0000-0002-4932-2874'
website: 'https://stefantaubert.com'
version: 0.0.2
date-released: 2022-09-08
license: MIT
url: https://github.com/stefantaubert/dict-from-dragonmapper
doi:
GitHub Events
Total
Last Year
Issues and Pull Requests
Last synced: 11 months ago
All Time
- Total issues: 0
- Total pull requests: 0
- Average time to close issues: N/A
- Average time to close pull requests: N/A
- Total issue authors: 0
- Total pull request authors: 0
- Average comments per issue: 0
- Average comments per pull request: 0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0
Past Year
- Issues: 0
- Pull requests: 0
- Average time to close issues: N/A
- Average time to close pull requests: N/A
- Issue authors: 0
- Pull request authors: 0
- Average comments per issue: 0
- Average comments per pull request: 0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0
Top Authors
Issue Authors
Pull Request Authors
Top Labels
Issue Labels
Pull Request Labels
Packages
- Total packages: 1
-
Total downloads:
- pypi 10 last-month
- Total dependent packages: 0
- Total dependent repositories: 1
- Total versions: 2
- Total maintainers: 1
pypi.org: dict-from-dragonmapper
Command-line interface (CLI) to create a pronunciation dictionary by looking up IPA transcriptions using dragonmapper including the possibility of ignoring punctuation and splitting words on hyphens before transcribing them.
- Homepage: https://github.com/stefantaubert/dict-from-dragonmapper
- Documentation: https://dict-from-dragonmapper.readthedocs.io/
- License: MIT
-
Latest release: 0.0.2
published almost 4 years ago
Rankings
Maintainers (1)
Dependencies
- autoflake * develop
- autopep8 * develop
- build * develop
- dict-from-dragonmapper * develop
- isort * develop
- pycodestyle * develop
- pylint * develop
- pytest * develop
- rope * develop
- tox * develop
- twine * develop
- dragonmapper >=0.2.6
- ordered-set >=4.1.0
- pronunciation-dictionary >=0.0.4
- tqdm *
- word-to-pronunciation >=0.0.1
- astroid ==2.12.9 develop
- attrs ==22.1.0 develop
- autoflake ==1.5.3 develop
- autopep8 ==1.7.0 develop
- bleach ==5.0.1 develop
- build ==0.8.0 develop
- certifi ==2022.6.15 develop
- cffi ==1.15.1 develop
- charset-normalizer ==2.1.1 develop
- commonmark ==0.9.1 develop
- cryptography ==38.0.1 develop
- dict-from-dragonmapper * develop
- dill ==0.3.5.1 develop
- distlib ==0.3.6 develop
- docutils ==0.19 develop
- dragonmapper ==0.2.6 develop
- filelock ==3.8.0 develop
- hanzidentifier ==1.0.2 develop
- idna ==3.3 develop
- importlib-metadata ==4.12.0 develop
- iniconfig ==1.1.1 develop
- isort ==5.10.1 develop
- jaraco.classes ==3.2.2 develop
- jeepney ==0.8.0 develop
- keyring ==23.9.1 develop
- lazy-object-proxy ==1.7.1 develop
- mccabe ==0.7.0 develop
- more-itertools ==8.14.0 develop
- ordered-set ==4.1.0 develop
- packaging ==21.3 develop
- pep517 ==0.13.0 develop
- pkginfo ==1.8.3 develop
- platformdirs ==2.5.2 develop
- pluggy ==1.0.0 develop
- pronunciation-dictionary ==0.0.4 develop
- py ==1.11.0 develop
- pycodestyle ==2.9.1 develop
- pycparser ==2.21 develop
- pyflakes ==2.5.0 develop
- pygments ==2.13.0 develop
- pylint ==2.15.0 develop
- pyparsing ==3.0.9 develop
- pytest ==7.1.3 develop
- pytoolconfig ==1.2.2 develop
- readme-renderer ==37.1 develop
- requests ==2.28.1 develop
- requests-toolbelt ==0.9.1 develop
- rfc3986 ==2.0.0 develop
- rich ==12.5.1 develop
- rope ==1.3.0 develop
- secretstorage ==3.3.3 develop
- six ==1.16.0 develop
- toml ==0.10.2 develop
- tomli ==2.0.1 develop
- tomlkit ==0.11.4 develop
- tox ==3.25.1 develop
- tqdm ==4.64.1 develop
- twine ==4.0.1 develop
- typing-extensions ==4.3.0 develop
- urllib3 ==1.26.12 develop
- virtualenv ==20.16.4 develop
- webencodings ==0.5.1 develop
- word-to-pronunciation ==0.0.1 develop
- wrapt ==1.14.1 develop
- zhon ==1.1.5 develop
- zipp ==3.8.1 develop
- dragonmapper ==0.2.6
- hanzidentifier ==1.0.2
- ordered-set ==4.1.0
- pronunciation-dictionary ==0.0.4
- tqdm ==4.64.1
- word-to-pronunciation ==0.0.1
- zhon ==1.1.5
- dragonmapper >=0.2.6, < 0.3
- ordered-set >= 4.1.0
- pronunciation-dictionary >= 0.0.4
- tqdm *
- word-to-pronunciation >= 0.0.1