jaconv

Pure-Python Japanese character interconverter for Hiragana, Katakana, Hankaku, and Zenkaku

https://github.com/ikegami-yukino/jaconv

Science Score: 13.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
  • DOI references
  • Academic publication links
  • Committers with academic emails
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (5.9%) to scientific vocabulary

Keywords

character-converter japanese-kana japanese-language julius preprocessing pure-python text-processing transliteration
Last synced: 6 months ago · JSON representation

Repository

Pure-Python Japanese character interconverter for Hiragana, Katakana, Hankaku, and Zenkaku

Basic Info
Statistics
  • Stars: 331
  • Watchers: 10
  • Forks: 32
  • Open Issues: 7
  • Releases: 0
Topics
character-converter japanese-kana japanese-language julius preprocessing pure-python text-processing transliteration
Created almost 10 years ago · Last pushed 7 months ago
Metadata Files
Readme Changelog Funding License

README.rst

jaconv
==========
|coveralls| |pyversion| |version| |license| |download|

jaconv (Japanese Converter) is interconverter for Hiragana, Katakana, Hankaku (half-width character) and Zenkaku (full-width character)

`Japanese README `_ is available.

INSTALLATION
==============

::

 $ pip install jaconv


USAGE
============

See also `document `_

.. code:: python

  import jaconv

  # Hiragana to Katakana
  jaconv.hira2kata('ともえまみ')
  # => 'トモエマミ'

  # Hiragana to half-width Katakana
  jaconv.hira2hkata('ともえまみ')
  # => 'トモエマミ'

  # Katakana to Hiragana
  jaconv.kata2hira('巴マミ')
  # => '巴まみ'

  # half-width character to full-width character
  # default parameters are followings: kana=True, ascii=False, digit=False
  jaconv.h2z('ティロ・フィナーレ')
  # => 'ティロ・フィナーレ'

  # half-width character to full-width character
  # but only ascii characters
  jaconv.h2z('abc', kana=False, ascii=True, digit=False)
  # => 'abc'

  # half-width character to full-width character
  # but only digit characters
  jaconv.h2z('123', kana=False, ascii=False, digit=True)
  # => '123'

  # half-width character to full-width character
  # except half-width Katakana
  jaconv.h2z('アabc123', kana=False, digit=True, ascii=True)
  # => 'アabc123'

  # an alias of h2z
  jaconv.hankaku2zenkaku('ティロ・フィナーレabc123')
  # => 'ティロ・フィナーレabc123'

  # full-width character to half-width character
  # default parameters are followings: kana=True, ascii=False, digit=False
  jaconv.z2h('ティロ・フィナーレ')
  # => 'ティロ・フィナーレ'

  # full-width character to half-width character
  # but only ascii characters
  jaconv.z2h('abc', kana=False, ascii=True, digit=False)
  # => 'abc'

  # full-width character to half-width character
  # but only digit characters
  jaconv.z2h('123', kana=False, ascii=False, digit=True)
  # => '123'

  # full-width character to half-width character
  # except full-width Katakana
  jaconv.z2h('アabc123', kana=False, digit=True, ascii=True)
  # => 'アabc123'

  # an alias of z2h
  jaconv.zenkaku2hankaku('ティロ・フィナーレabc123')
  # => 'ティロ・フィナーレabc123'

  # normalize
  jaconv.normalize('ティロ・フィナ〜レ', 'NFKC')
  # => 'ティロ・フィナーレ'

  # Hiragana to alphabet
  jaconv.kana2alphabet('じゃぱん')
  # => 'japan'

  # Alphabet to Hiragana
  jaconv.alphabet2kana('japan')
  # => 'じゃぱん'

  # Katakana to Alphabet
  jaconv.kata2alphabet('ケツイ')
  # => 'ketsui'

  # Alphabet to Katakana
  jaconv.alphabet2kata('namba')
  # => 'ナンバ'

  # Hiragana to Julius's phoneme format
  jaconv.hiragana2julius('てんきすごくいいいいいい')
  # => 't e N k i s u g o k u i:'


NOTE
============

jaconv.normalize method expand unicodedata.normalize for Japanese language processing.

.. code::

    '〜' => 'ー'
    '~' => 'ー'
    "’" => "'"
    '”'=> '"'
    '“' => '``'
    '―' => '-'
    '‐' => '-'
    '˗' => '-'
    '֊' => '-'
    '‐' => '-'
    '‑' => '-'
    '‒' => '-'
    '–' => '-'
    '⁃' => '-'
    '⁻' => '-'
    '₋' => '-'
    '−' => '-'
    '﹣' => 'ー'
    '-' => 'ー'
    '—' => 'ー'
    '―' => 'ー'
    '━' => 'ー'
    '─' => 'ー'




.. |coveralls| image:: https://coveralls.io/repos/ikegami-yukino/jaconv/badge.svg?branch=master&service=github
    :target: https://coveralls.io/github/ikegami-yukino/jaconv?branch=master
    :alt: coveralls.io

.. |pyversion| image:: https://img.shields.io/pypi/pyversions/jaconv.svg

.. |version| image:: https://img.shields.io/pypi/v/jaconv.svg
    :target: http://pypi.python.org/pypi/jaconv/
    :alt: latest version

.. |license| image:: https://img.shields.io/pypi/l/jaconv.svg
    :target: http://pypi.python.org/pypi/jaconv/
    :alt: license

.. |download| image:: https://static.pepy.tech/personalized-badge/neologdn?period=total&units=international_system&left_color=black&right_color=blue&left_text=Downloads
    :target: https://pepy.tech/project/neologdn
    :alt: download

Owner

  • Name: IKEGAMI Yukino
  • Login: ikegami-yukino
  • Kind: user
  • Location: Tokyo, Japan

I am interested in Natural Language Processing (NLP). I develop OSS for the benefit of the public.

GitHub Events

Total
  • Issues event: 1
  • Watch event: 17
  • Issue comment event: 6
  • Push event: 2
  • Pull request review event: 1
  • Pull request event: 6
  • Fork event: 5
Last Year
  • Issues event: 1
  • Watch event: 17
  • Issue comment event: 6
  • Push event: 2
  • Pull request review event: 1
  • Pull request event: 6
  • Fork event: 5

Committers

Last synced: 9 months ago

All Time
  • Total Commits: 108
  • Total Committers: 13
  • Avg Commits per committer: 8.308
  • Development Distribution Score (DDS): 0.241
Past Year
  • Commits: 3
  • Committers: 3
  • Avg Commits per committer: 1.0
  • Development Distribution Score (DDS): 0.667
Top Committers
Name Email Commits
Yukino Ikegami y****m@g****m 82
Ika4 s****m@y****p 8
山田一権 k****a@x****p 3
Sho Shimauchi s****i@g****m 3
Cuddlemuffin007 b****s@g****m 3
Kohki Mametani k****i@g****m 2
furukawaTakumi b****8@g****m 1
Shin Kojima s****n@k****g 1
Le Tuan Anh t****e@g****m 1
Ken Sato k****0@g****m 1
Ian Hunter i****r@g****m 1
Frog 42 f****2@g****m 1
Eli Schwartz e****z@g****g 1
Committer Domains (Top 20 + Academic)

Issues and Pull Requests

Last synced: 6 months ago

All Time
  • Total issues: 15
  • Total pull requests: 25
  • Average time to close issues: about 2 months
  • Average time to close pull requests: about 1 month
  • Total issue authors: 12
  • Total pull request authors: 14
  • Average comments per issue: 1.4
  • Average comments per pull request: 1.52
  • Merged pull requests: 20
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 2
  • Pull requests: 5
  • Average time to close issues: N/A
  • Average time to close pull requests: 12 days
  • Issue authors: 2
  • Pull request authors: 3
  • Average comments per issue: 0.0
  • Average comments per pull request: 1.4
  • Merged pull requests: 2
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
  • pekkarr (2)
  • jet082 (2)
  • manjuu-eater (1)
  • keeofkoo (1)
  • ymzlygw (1)
  • axel584 (1)
  • kzthrk (1)
  • fabaff (1)
  • mknz (1)
  • Cuddlemuffin007 (1)
  • nicoyou (1)
  • nemu626 (1)
Pull Request Authors
  • ikegami-yukino (5)
  • eli-schwartz (4)
  • manjuu-eater (3)
  • nemu626 (3)
  • kokimame (2)
  • tabe (2)
  • ianfhunter (2)
  • MikimotoH (1)
  • furukawaTakumi (1)
  • qykth-git (1)
  • Cuddlemuffin007 (1)
  • frog42 (1)
  • letuananh (1)
  • shiumachi (1)
  • ernix (1)
Top Labels
Issue Labels
Pull Request Labels

Packages

  • Total packages: 3
  • Total downloads:
    • pypi 1,610,398 last-month
  • Total docker downloads: 7,131
  • Total dependent packages: 27
    (may contain duplicates)
  • Total dependent repositories: 198
    (may contain duplicates)
  • Total versions: 14
  • Total maintainers: 2
pypi.org: jaconv

Pure-Python Japanese character interconverter for Hiragana, Katakana, Hankaku, Zenkaku and more

  • Versions: 12
  • Dependent Packages: 25
  • Dependent Repositories: 198
  • Downloads: 1,610,398 Last month
  • Docker Downloads: 7,131
Rankings
Downloads: 0.3%
Dependent packages count: 0.6%
Dependent repos count: 1.1%
Docker downloads count: 1.5%
Average: 2.6%
Stargazers count: 3.9%
Forks count: 8.0%
Maintainers (1)
Last synced: 6 months ago
spack.io: py-jaconv

jaconv (Japanese Converter) is interconverter for Hiragana, Katakana, Hankaku (half-width character) and Zenkaku (full-width character)

  • Versions: 1
  • Dependent Packages: 0
  • Dependent Repositories: 0
Rankings
Dependent repos count: 0.0%
Stargazers count: 13.3%
Forks count: 20.3%
Average: 22.7%
Dependent packages count: 57.3%
Maintainers (1)
Last synced: 6 months ago
conda-forge.org: jaconv
  • Versions: 1
  • Dependent Packages: 2
  • Dependent Repositories: 0
Rankings
Dependent packages count: 19.5%
Stargazers count: 22.2%
Average: 27.3%
Forks count: 33.2%
Dependent repos count: 34.0%
Last synced: 6 months ago