uyghur-multi-script-converter

This converter converts multiple Uyghur scripts: ULS(Uyghur Latin Script), UAS(Uyghur Arabic Script), CTS(Common Turkick Scritp), UCS(Uyghur Cyrilik Script) and Uyghur Yengi (new) Script.

https://github.com/neouyghur/uyghur-multi-script-converter

Science Score: 44.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (6.0%) to scientific vocabulary

Keywords

converter python script uyghur
Last synced: 4 months ago · JSON representation ·

Repository

This converter converts multiple Uyghur scripts: ULS(Uyghur Latin Script), UAS(Uyghur Arabic Script), CTS(Common Turkick Scritp), UCS(Uyghur Cyrilik Script) and Uyghur Yengi (new) Script.

Basic Info
  • Host: GitHub
  • Owner: neouyghur
  • License: apache-2.0
  • Language: Python
  • Default Branch: master
  • Homepage:
  • Size: 1.94 MB
Statistics
  • Stars: 51
  • Watchers: 3
  • Forks: 22
  • Open Issues: 3
  • Releases: 0
Topics
converter python script uyghur
Created almost 10 years ago · Last pushed 5 months ago
Metadata Files
Readme License Citation

README.md

Script Converter for Uyghur Language

This converter supports multiple Uyghur writing systems: - ULS Uyghur Latin Script
- UAS Uyghur Arabic Script
- CTS Common Turkish Script
- UCS Uyghur Cyrillic Script
- UYS Uyghur Yengi (New) Script
- IPA International Phonetic Alphabet
- UZLS Uzbek Latin Script
- XJUS Xinjiang University Script

Installation

pip install umsc

Mapping table

| UAS | CTS | ULS| UCS|UYS| IPA | UZLS | XJUS | |-----|----| ---- | --- | -- |-------|---|------| | | a | a | |a | // | o | a | | | e | e | | | // | a | A | | | b | b | |b | /b/ | b | b | | | p | p | |p | /p/ | p | p | | | t | t | |t | /t/ | t | t | | | c | j | |j | /d/ | j | j | | | | ch | |q | /t/ | ch | c | | | x | x | |h | // | x | H | | | d | d | |d | /d/ | d | d | | | r | r | |r | /r/ | r | r | | | z | z | |z | /z/ | z | z | | | j | zh | | | // | j | J | | | s | s | |s | /s/ | s | s | | | | sh | |x | // | sh | x | | | f | f | |f | /f/ | f | f | | | | ng | |ng | // | ng | N | | | l | l | |l | /l/ | l | l | | | m | m | |m | /m/ | m | m | | | h | h | | | /h/ | h | h | | | o | o | |o | /o/ | o | o | | | u | u | |u | /u/ | u | u | | | | | | | // | o | O | | | | | | | /y/ | u | U | | | v | w | |w | /w/ | v | w | | | | | |e | // | e | e | | | i | i | |i | /i/ | i | i | | | y | y | |y | /j/ | y | y | | | q | q | | | /q/ | q | q | | | k | k | |k | /k/ | k | k | | | g | g | |g | // | g | g | | | n | n | |n | /n/ | n | n | | | | gh | | | // | g | G | | | | | | | | | v | | | ya | ya | |ya | | ya | ya | | | yu | yu | |yu | | yu | yu |

Sample input and output examples

Review the files in the tests directory for examples of converting between different scripts.

Usage

``` from umsc import UgMultiScriptConverter

To convert text, you need to define source and target scripts

The abbreviation of scrips

ULS | Uyghur Latin Script

UYS | Uyghur Yengi (New) Script

CPS | Chinese Pinyin Script

UAS | Uyghur Arabic Script

CTS |Common Turkic Script

UCS | Uyghur Cyrillic Script

XJU | Xinjinag University English Case Sensitive

UZLS | Uzbek Latin Script

Convert Uyghur Arabic Script to Uyghur Latin Script

sourcescript = 'UAS' targetscript = 'ULS' converter = UgMultiScriptConverter(sourcescript, targetscript) text1 = '!' text1 = converter(text1) print(text1)

Convert Uyghur Latin Script to Uyghur Arabic Script

sourcescript = 'ULS' targetscript = 'UAS' converter = UgMultiScriptConverter(sourcescript, targetscript) text2 = 'yaxshimusiz!' text2 = converter(text2) print(text2) ```

Notes

  • API and Uzbek are currently developing. The mapping might not be very accurate. Especially for Uzbek, it is not very clear how to map "" and "" to Uzbek.

Citation

If you wish to cite this project, please use cite this repository.

Contributing

Feel free to raise issue and pull request.

License

Distributed under the Apache 2.0 License. See LICENSE for more information.

Owner

  • Name: Osman Tursun
  • Login: neouyghur
  • Kind: user
  • Location: Brisbane
  • Company: QUT SAIVT Lab.

Coding instead of begging

Citation (CITATION.cff)

cff-version: 1.2.0
message: "If you use this software, please cite it as below."
authors:
  - family-names: Osman
    given-names: Tursun
    orcid: https://orcid.org/0000-0002-0592-0864
title: "Uyghur Multi-Script Converter"
version: 1.0

GitHub Events

Total
  • Issues event: 4
  • Watch event: 10
  • Issue comment event: 8
  • Push event: 4
  • Pull request event: 1
  • Fork event: 2
Last Year
  • Issues event: 4
  • Watch event: 10
  • Issue comment event: 8
  • Push event: 4
  • Pull request event: 1
  • Fork event: 2

Dependencies

setup.py pypi