janome
Japanese morphological analysis engine written in pure Python
Science Score: 13.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
○CITATION.cff file
-
✓codemeta.json file
Found codemeta.json file -
○.zenodo.json file
-
○DOI references
-
○Academic publication links
-
○Committers with academic emails
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (8.5%) to scientific vocabulary
Keywords
japanese-language
nlp-library
python
Last synced: 6 months ago
·
JSON representation
Repository
Japanese morphological analysis engine written in pure Python
Basic Info
- Host: GitHub
- Owner: mocobeta
- License: apache-2.0
- Language: Python
- Default Branch: master
- Homepage: https://janome.mocobeta.dev
- Size: 403 MB
Statistics
- Stars: 889
- Watchers: 31
- Forks: 54
- Open Issues: 15
- Releases: 2
Topics
japanese-language
nlp-library
python
Created about 11 years ago
· Last pushed about 1 year ago
Metadata Files
Readme
Changelog
Contributing
License
README.rst
=======
Janome
=======
.. image:: https://github.com/mocobeta/janome/workflows/Checks/badge.svg
:target: https://github.com/mocobeta/janome/actions?query=workflow%3AChecks
.. image:: https://coveralls.io/repos/github/mocobeta/janome/badge.svg?branch=master
:target: https://coveralls.io/github/mocobeta/janome?branch=master
.. image:: https://img.shields.io/pypi/dm/Janome.svg
:target: https://pypistats.org/packages/janome
.. image:: https://img.shields.io/conda/v/conda-forge/janome
:target: https://anaconda.org/conda-forge/janome
Janome is a Japanese morphological analysis engine written in pure Python.
General documentation:
https://janome.mocobeta.dev/en/ (English)
https://janome.mocobeta.dev/ja/ (Japanese)
Requirements
=============
Python 3.7+ is required.
Install
========
[Note] This consumes about 500 MB memory for building.
.. code:: bash
(venv) $ pip install janome
Run
====
.. code:: bash
(venv) $ python
>>> from janome.tokenizer import Tokenizer
>>> t = Tokenizer()
>>> for token in t.tokenize('すもももももももものうち'):
... print(token)
...
すもも 名詞,一般,*,*,*,*,すもも,スモモ,スモモ
も 助詞,係助詞,*,*,*,*,も,モ,モ
もも 名詞,一般,*,*,*,*,もも,モモ,モモ
も 助詞,係助詞,*,*,*,*,も,モ,モ
もも 名詞,一般,*,*,*,*,もも,モモ,モモ
の 助詞,連体化,*,*,*,*,の,ノ,ノ
うち 名詞,非自立,副詞可能,*,*,*,うち,ウチ,ウチ
License
========
Licensed under Apache License 2.0 and uses the MeCab-IPADIC dictionary/statistical model.
See LICENSE.txt and NOTICE.txt for license details.
Acknowledgement
================
Special thanks to @ikawaha, @takuyaa, @nakagami and @janome_oekaki.
Copyright
==========
Copyright(C) 2015-2025, Tomoko Uchida. All rights reserved.
Owner
- Name: Tomoko Uchida
- Login: mocobeta
- Kind: user
- Location: Tokyo
- Website: https://blog.mocobeta.dev/
- Repositories: 21
- Profile: https://github.com/mocobeta
Search Engineer
GitHub Events
Total
- Watch event: 38
- Delete event: 7
- Issue comment event: 5
- Push event: 29
- Gollum event: 1
- Pull request event: 15
- Fork event: 2
- Create event: 8
Last Year
- Watch event: 38
- Delete event: 7
- Issue comment event: 5
- Push event: 29
- Gollum event: 1
- Pull request event: 15
- Fork event: 2
- Create event: 8
Committers
Last synced: 9 months ago
Top Committers
| Name | Commits | |
|---|---|---|
| Tomoko Uchida | t****1@g****m | 403 |
| Hajime Nakagami | n****i@g****m | 7 |
| Hiroyuki Tanaka | h****a@f****p | 6 |
| Tomoko Uchida | t****a@u****m | 3 |
| onishik88 | o****k@i****m | 3 |
| takeshi0406 | s****g@g****m | 2 |
| moco | c****a@g****m | 1 |
| moco | m****o@l****n | 1 |
| Andrii Oriekhov | a****v@g****m | 1 |
| Bastian Zimmermann | 1****m | 1 |
| Masahiro Kamata | 8****9@g****m | 1 |
| Norihito Ishida | 4****a | 1 |
| Shuto Ueki | u****r@g****m | 1 |
| Yasuhisa Yoshida | s****2@g****m | 1 |
| ikawaha | i****a | 1 |
| narupo | n****o | 1 |
| saito400 | k****0@g****m | 1 |
| takahi-i | t****o@g****m | 1 |
| uezo | u****o@u****t | 1 |
Committer Domains (Top 20 + Academic)
Issues and Pull Requests
Last synced: 6 months ago
All Time
- Total issues: 44
- Total pull requests: 81
- Average time to close issues: 8 months
- Average time to close pull requests: 24 days
- Total issue authors: 27
- Total pull request authors: 17
- Average comments per issue: 2.07
- Average comments per pull request: 1.26
- Merged pull requests: 72
- Bot issues: 0
- Bot pull requests: 2
Past Year
- Issues: 0
- Pull requests: 16
- Average time to close issues: N/A
- Average time to close pull requests: about 1 hour
- Issue authors: 0
- Pull request authors: 1
- Average comments per issue: 0
- Average comments per pull request: 0.56
- Merged pull requests: 13
- Bot issues: 0
- Bot pull requests: 0
Top Authors
Issue Authors
- mocobeta (14)
- nekomimimaiden257 (3)
- kdrkdrkdr (2)
- wimpykid719 (2)
- annisat (1)
- otariidae (1)
- norihitoishida (1)
- nkmk (1)
- SKIYOH (1)
- saito400 (1)
- takanamir (1)
- mmcauliffe (1)
- quaizarv (1)
- HeroadZ (1)
- d371 (1)
Pull Request Authors
- mocobeta (59)
- nakagami (4)
- dependabot[bot] (2)
- ghost (2)
- andriyor (1)
- syou6162 (1)
- takeshi0406 (1)
- takahi-i (1)
- norihitoishida (1)
- roy-ht (1)
- BastianZim (1)
- sueki1242 (1)
- onishik88 (1)
- saito400 (1)
- kamatari (1)
Top Labels
Issue Labels
enhancement (5)
wontfix (2)
bug (2)
build (1)
help wanted (1)
good first issue (1)
Pull Request Labels
enhancement (2)
dependencies (2)
Packages
- Total packages: 2
-
Total downloads:
- pypi 187,345 last-month
- Total docker downloads: 7
-
Total dependent packages: 17
(may contain duplicates) -
Total dependent repositories: 137
(may contain duplicates) - Total versions: 28
- Total maintainers: 1
pypi.org: janome
Japanese morphological analysis engine.
- Homepage: https://mocobeta.github.io/janome/en/
- Documentation: https://janome.readthedocs.io/
- License: AL2
-
Latest release: 0.5.0
published over 2 years ago
Rankings
Downloads: 0.8%
Dependent packages count: 0.8%
Dependent repos count: 1.3%
Stargazers count: 2.2%
Average: 2.5%
Docker downloads count: 4.0%
Forks count: 5.9%
Maintainers (1)
Last synced:
6 months ago
conda-forge.org: janome
- Homepage: https://github.com/mocobeta/janome
- License: Apache-2.0
-
Latest release: 0.4.2
published about 4 years ago
Rankings
Stargazers count: 13.4%
Forks count: 22.9%
Average: 24.8%
Dependent packages count: 28.8%
Dependent repos count: 34.0%
Last synced:
6 months ago
Dependencies
docs/requirements-docs.txt
pypi
- Sphinx ==4.4.0
- sphinxcontrib-github-ribbon ==0.9.0
requirements-dev.txt
pypi
- autopep8 ==1.5.4 development
- flake8 ==3.8.3 development
- mccabe ==0.6.1 development
- mypy ==0.782 development
- mypy-extensions ==0.4.3 development
- psutil ==5.9.0 development
- pycodestyle ==2.6.0 development
- pyflakes ==2.2.0 development
- toml ==0.10.1 development
- twine ==3.2.0 development
- typed-ast ==1.4.1 development
- typing-extensions ==3.7.4.2 development
- wheel ==0.35.1 development
.github/workflows/checks.yml
actions
- AndreMiras/coveralls-python-action develop composite
- actions/checkout v2 composite
- actions/setup-python v2 composite
setup.py
pypi