uyghur-multi-script-converter
This converter converts multiple Uyghur scripts: ULS(Uyghur Latin Script), UAS(Uyghur Arabic Script), CTS(Common Turkick Scritp), UCS(Uyghur Cyrilik Script) and Uyghur Yengi (new) Script.
Science Score: 44.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
○DOI references
-
○Academic publication links
-
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (6.0%) to scientific vocabulary
Keywords
Repository
This converter converts multiple Uyghur scripts: ULS(Uyghur Latin Script), UAS(Uyghur Arabic Script), CTS(Common Turkick Scritp), UCS(Uyghur Cyrilik Script) and Uyghur Yengi (new) Script.
Basic Info
Statistics
- Stars: 51
- Watchers: 3
- Forks: 22
- Open Issues: 3
- Releases: 0
Topics
Metadata Files
README.md
Script Converter for Uyghur Language
This converter supports multiple Uyghur writing systems:
- ULS Uyghur Latin Script
- UAS Uyghur Arabic Script
- CTS Common Turkish Script
- UCS Uyghur Cyrillic Script
- UYS Uyghur Yengi (New) Script
- IPA International Phonetic Alphabet
- UZLS Uzbek Latin Script
- XJUS Xinjiang University Script
Installation
pip install umsc
Mapping table
| UAS | CTS | ULS| UCS|UYS| IPA | UZLS | XJUS | |-----|----| ---- | --- | -- |-------|---|------| | | a | a | |a | // | o | a | | | e | e | | | // | a | A | | | b | b | |b | /b/ | b | b | | | p | p | |p | /p/ | p | p | | | t | t | |t | /t/ | t | t | | | c | j | |j | /d/ | j | j | | | | ch | |q | /t/ | ch | c | | | x | x | |h | // | x | H | | | d | d | |d | /d/ | d | d | | | r | r | |r | /r/ | r | r | | | z | z | |z | /z/ | z | z | | | j | zh | | | // | j | J | | | s | s | |s | /s/ | s | s | | | | sh | |x | // | sh | x | | | f | f | |f | /f/ | f | f | | | | ng | |ng | // | ng | N | | | l | l | |l | /l/ | l | l | | | m | m | |m | /m/ | m | m | | | h | h | | | /h/ | h | h | | | o | o | |o | /o/ | o | o | | | u | u | |u | /u/ | u | u | | | | | | | // | o | O | | | | | | | /y/ | u | U | | | v | w | |w | /w/ | v | w | | | | | |e | // | e | e | | | i | i | |i | /i/ | i | i | | | y | y | |y | /j/ | y | y | | | q | q | | | /q/ | q | q | | | k | k | |k | /k/ | k | k | | | g | g | |g | // | g | g | | | n | n | |n | /n/ | n | n | | | | gh | | | // | g | G | | | | | | | | | v | | | ya | ya | |ya | | ya | ya | | | yu | yu | |yu | | yu | yu |
Sample input and output examples
Review the files in the tests directory for examples of converting between different scripts.
Usage
``` from umsc import UgMultiScriptConverter
To convert text, you need to define source and target scripts
The abbreviation of scrips
ULS | Uyghur Latin Script
UYS | Uyghur Yengi (New) Script
CPS | Chinese Pinyin Script
UAS | Uyghur Arabic Script
CTS |Common Turkic Script
UCS | Uyghur Cyrillic Script
XJU | Xinjinag University English Case Sensitive
UZLS | Uzbek Latin Script
Convert Uyghur Arabic Script to Uyghur Latin Script
sourcescript = 'UAS' targetscript = 'ULS' converter = UgMultiScriptConverter(sourcescript, targetscript) text1 = '!' text1 = converter(text1) print(text1)
Convert Uyghur Latin Script to Uyghur Arabic Script
sourcescript = 'ULS' targetscript = 'UAS' converter = UgMultiScriptConverter(sourcescript, targetscript) text2 = 'yaxshimusiz!' text2 = converter(text2) print(text2) ```
Notes
- API and Uzbek are currently developing. The mapping might not be very accurate. Especially for Uzbek, it is not very clear how to map "" and "" to Uzbek.
Citation
If you wish to cite this project, please use cite this repository.
Contributing
Feel free to raise issue and pull request.
License
Distributed under the Apache 2.0 License. See LICENSE for more information.
Owner
- Name: Osman Tursun
- Login: neouyghur
- Kind: user
- Location: Brisbane
- Company: QUT SAIVT Lab.
- Repositories: 49
- Profile: https://github.com/neouyghur
Coding instead of begging
Citation (CITATION.cff)
cff-version: 1.2.0
message: "If you use this software, please cite it as below."
authors:
- family-names: Osman
given-names: Tursun
orcid: https://orcid.org/0000-0002-0592-0864
title: "Uyghur Multi-Script Converter"
version: 1.0
GitHub Events
Total
- Issues event: 4
- Watch event: 10
- Issue comment event: 8
- Push event: 4
- Pull request event: 1
- Fork event: 2
Last Year
- Issues event: 4
- Watch event: 10
- Issue comment event: 8
- Push event: 4
- Pull request event: 1
- Fork event: 2