janome

Japanese morphological analysis engine written in pure Python

https://github.com/mocobeta/janome

Science Score: 13.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
  • DOI references
  • Academic publication links
  • Committers with academic emails
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (8.5%) to scientific vocabulary

Keywords

japanese-language nlp-library python
Last synced: 6 months ago · JSON representation

Repository

Japanese morphological analysis engine written in pure Python

Basic Info
  • Host: GitHub
  • Owner: mocobeta
  • License: apache-2.0
  • Language: Python
  • Default Branch: master
  • Homepage: https://janome.mocobeta.dev
  • Size: 403 MB
Statistics
  • Stars: 889
  • Watchers: 31
  • Forks: 54
  • Open Issues: 15
  • Releases: 2
Topics
japanese-language nlp-library python
Created about 11 years ago · Last pushed about 1 year ago
Metadata Files
Readme Changelog Contributing License

README.rst

=======
Janome
=======

.. image:: https://github.com/mocobeta/janome/workflows/Checks/badge.svg
    :target: https://github.com/mocobeta/janome/actions?query=workflow%3AChecks

.. image:: https://coveralls.io/repos/github/mocobeta/janome/badge.svg?branch=master
    :target: https://coveralls.io/github/mocobeta/janome?branch=master

.. image:: https://img.shields.io/pypi/dm/Janome.svg
    :target: https://pypistats.org/packages/janome

.. image:: https://img.shields.io/conda/v/conda-forge/janome
    :target: https://anaconda.org/conda-forge/janome

Janome is a Japanese morphological analysis engine written in pure Python.

General documentation:

https://janome.mocobeta.dev/en/ (English)

https://janome.mocobeta.dev/ja/ (Japanese)

Requirements
=============

Python 3.7+ is required.

Install
========

[Note] This consumes about 500 MB memory for building.

.. code:: bash

  (venv) $ pip install janome

Run
====

.. code:: bash

  (venv) $ python
  >>> from janome.tokenizer import Tokenizer
  >>> t = Tokenizer()
  >>> for token in t.tokenize('すもももももももものうち'):
  ...     print(token)
  ...
  すもも 名詞,一般,*,*,*,*,すもも,スモモ,スモモ
  も    助詞,係助詞,*,*,*,*,も,モ,モ
  もも  名詞,一般,*,*,*,*,もも,モモ,モモ
  も    助詞,係助詞,*,*,*,*,も,モ,モ
  もも  名詞,一般,*,*,*,*,もも,モモ,モモ
  の    助詞,連体化,*,*,*,*,の,ノ,ノ
  うち  名詞,非自立,副詞可能,*,*,*,うち,ウチ,ウチ

License
========

Licensed under Apache License 2.0 and uses the MeCab-IPADIC dictionary/statistical model.

See LICENSE.txt and NOTICE.txt for license details.

Acknowledgement
================

Special thanks to @ikawaha, @takuyaa, @nakagami and @janome_oekaki.

Copyright
==========

Copyright(C) 2015-2025, Tomoko Uchida. All rights reserved.

Owner

  • Name: Tomoko Uchida
  • Login: mocobeta
  • Kind: user
  • Location: Tokyo

Search Engineer

GitHub Events

Total
  • Watch event: 38
  • Delete event: 7
  • Issue comment event: 5
  • Push event: 29
  • Gollum event: 1
  • Pull request event: 15
  • Fork event: 2
  • Create event: 8
Last Year
  • Watch event: 38
  • Delete event: 7
  • Issue comment event: 5
  • Push event: 29
  • Gollum event: 1
  • Pull request event: 15
  • Fork event: 2
  • Create event: 8

Committers

Last synced: 9 months ago

All Time
  • Total Commits: 437
  • Total Committers: 19
  • Avg Commits per committer: 23.0
  • Development Distribution Score (DDS): 0.078
Past Year
  • Commits: 10
  • Committers: 1
  • Avg Commits per committer: 10.0
  • Development Distribution Score (DDS): 0.0
Top Committers
Name Email Commits
Tomoko Uchida t****1@g****m 403
Hajime Nakagami n****i@g****m 7
Hiroyuki Tanaka h****a@f****p 6
Tomoko Uchida t****a@u****m 3
onishik88 o****k@i****m 3
takeshi0406 s****g@g****m 2
moco c****a@g****m 1
moco m****o@l****n 1
Andrii Oriekhov a****v@g****m 1
Bastian Zimmermann 1****m 1
Masahiro Kamata 8****9@g****m 1
Norihito Ishida 4****a 1
Shuto Ueki u****r@g****m 1
Yasuhisa Yoshida s****2@g****m 1
ikawaha i****a 1
narupo n****o 1
saito400 k****0@g****m 1
takahi-i t****o@g****m 1
uezo u****o@u****t 1
Committer Domains (Top 20 + Academic)

Issues and Pull Requests

Last synced: 6 months ago

All Time
  • Total issues: 44
  • Total pull requests: 81
  • Average time to close issues: 8 months
  • Average time to close pull requests: 24 days
  • Total issue authors: 27
  • Total pull request authors: 17
  • Average comments per issue: 2.07
  • Average comments per pull request: 1.26
  • Merged pull requests: 72
  • Bot issues: 0
  • Bot pull requests: 2
Past Year
  • Issues: 0
  • Pull requests: 16
  • Average time to close issues: N/A
  • Average time to close pull requests: about 1 hour
  • Issue authors: 0
  • Pull request authors: 1
  • Average comments per issue: 0
  • Average comments per pull request: 0.56
  • Merged pull requests: 13
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
  • mocobeta (14)
  • nekomimimaiden257 (3)
  • kdrkdrkdr (2)
  • wimpykid719 (2)
  • annisat (1)
  • otariidae (1)
  • norihitoishida (1)
  • nkmk (1)
  • SKIYOH (1)
  • saito400 (1)
  • takanamir (1)
  • mmcauliffe (1)
  • quaizarv (1)
  • HeroadZ (1)
  • d371 (1)
Pull Request Authors
  • mocobeta (59)
  • nakagami (4)
  • dependabot[bot] (2)
  • ghost (2)
  • andriyor (1)
  • syou6162 (1)
  • takeshi0406 (1)
  • takahi-i (1)
  • norihitoishida (1)
  • roy-ht (1)
  • BastianZim (1)
  • sueki1242 (1)
  • onishik88 (1)
  • saito400 (1)
  • kamatari (1)
Top Labels
Issue Labels
enhancement (5) wontfix (2) bug (2) build (1) help wanted (1) good first issue (1)
Pull Request Labels
enhancement (2) dependencies (2)

Packages

  • Total packages: 2
  • Total downloads:
    • pypi 187,345 last-month
  • Total docker downloads: 7
  • Total dependent packages: 17
    (may contain duplicates)
  • Total dependent repositories: 137
    (may contain duplicates)
  • Total versions: 28
  • Total maintainers: 1
pypi.org: janome

Japanese morphological analysis engine.

  • Versions: 26
  • Dependent Packages: 16
  • Dependent Repositories: 137
  • Downloads: 187,345 Last month
  • Docker Downloads: 7
Rankings
Downloads: 0.8%
Dependent packages count: 0.8%
Dependent repos count: 1.3%
Stargazers count: 2.2%
Average: 2.5%
Docker downloads count: 4.0%
Forks count: 5.9%
Maintainers (1)
Last synced: 6 months ago
conda-forge.org: janome
  • Versions: 2
  • Dependent Packages: 1
  • Dependent Repositories: 0
Rankings
Stargazers count: 13.4%
Forks count: 22.9%
Average: 24.8%
Dependent packages count: 28.8%
Dependent repos count: 34.0%
Last synced: 6 months ago

Dependencies

docs/requirements-docs.txt pypi
  • Sphinx ==4.4.0
  • sphinxcontrib-github-ribbon ==0.9.0
requirements-dev.txt pypi
  • autopep8 ==1.5.4 development
  • flake8 ==3.8.3 development
  • mccabe ==0.6.1 development
  • mypy ==0.782 development
  • mypy-extensions ==0.4.3 development
  • psutil ==5.9.0 development
  • pycodestyle ==2.6.0 development
  • pyflakes ==2.2.0 development
  • toml ==0.10.1 development
  • twine ==3.2.0 development
  • typed-ast ==1.4.1 development
  • typing-extensions ==3.7.4.2 development
  • wheel ==0.35.1 development
.github/workflows/checks.yml actions
  • AndreMiras/coveralls-python-action develop composite
  • actions/checkout v2 composite
  • actions/setup-python v2 composite
setup.py pypi