https://github.com/doccano/doccano-transformer
The official tool for transforming doccano format into common dataset formats.
Science Score: 13.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
○CITATION.cff file
-
✓codemeta.json file
Found codemeta.json file -
○.zenodo.json file
-
○DOI references
-
○Academic publication links
-
○Committers with academic emails
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (7.7%) to scientific vocabulary
Keywords
Keywords from Contributors
Repository
The official tool for transforming doccano format into common dataset formats.
Basic Info
Statistics
- Stars: 108
- Watchers: 9
- Forks: 35
- Open Issues: 19
- Releases: 1
Topics
Metadata Files
README.md
doccano-transformer
Doccano Transformer helps you to transform an exported dataset into the format of your favorite machine learning library.
Supported formats
Doccano Transformer supports the following formats:
- CoNLL 2003
- spaCy
Install
To install doccano-transformer, simply use pip:
bash
pip install doccano-transformer
Examples
Named Entity Recognition
The following formats are supported:
- CoNLL 2003
- spaCy
```python from doccanotransformer.datasets import NERDataset from doccanotransformer.utils import read_jsonl
dataset = readjsonl(filepath='example.jsonl', dataset=NERDataset, encoding='utf-8') dataset.toconll2003(tokenizer=str.split) dataset.to_spacy(tokenizer=str.split) ```
Contribution
We encourage you to contribute to doccano transformer! Please check out the Contributing to doccano transformer guide for guidelines about how to proceed.
License
Owner
- Name: doccano
- Login: doccano
- Kind: organization
- Repositories: 9
- Profile: https://github.com/doccano
GitHub Events
Total
- Watch event: 2
- Fork event: 2
Last Year
- Watch event: 2
- Fork event: 2
Committers
Last synced: 12 months ago
Top Committers
| Name | Commits | |
|---|---|---|
| yasufumi | y****i@g****m | 59 |
| Hironsan | l****3@g****m | 21 |
| dependabot[bot] | 4****] | 1 |
| The Codacy Badger | b****r@c****m | 1 |
| Marcel | m****n@m****g | 1 |
Committer Domains (Top 20 + Academic)
Issues and Pull Requests
Last synced: 6 months ago
All Time
- Total issues: 22
- Total pull requests: 17
- Average time to close issues: 2 months
- Average time to close pull requests: 3 months
- Total issue authors: 17
- Total pull request authors: 8
- Average comments per issue: 2.23
- Average comments per pull request: 0.35
- Merged pull requests: 7
- Bot issues: 0
- Bot pull requests: 6
Past Year
- Issues: 0
- Pull requests: 0
- Average time to close issues: N/A
- Average time to close pull requests: N/A
- Issue authors: 0
- Pull request authors: 0
- Average comments per issue: 0
- Average comments per pull request: 0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0
Top Authors
Issue Authors
- Hironsan (2)
- jdixosnd (2)
- rjuez00 (2)
- harunkuf (2)
- shivaraj1994 (2)
- Akashdesarda (1)
- ayeshah (1)
- cliuxinxin (1)
- gunturbudi (1)
- 7brokenmirrors (1)
- AkimParis (1)
- Aj-232425 (1)
- nk-alex (1)
- gilokip (1)
- fangd123 (1)
Pull Request Authors
- dependabot[bot] (6)
- yasufumy (4)
- prokotg (2)
- Hironsan (1)
- codacy-badger (1)
- henrique-voni (1)
- duranbe (1)
Top Labels
Issue Labels
Pull Request Labels
Dependencies
- autopep8 * develop
- flake8 * develop
- ipython * develop
- isort * develop
- pytest * develop
- pytest-cov * develop
- pytest-datadir * develop
- spacy *
- attrs ==20.3.0 develop
- autopep8 ==1.5.4 develop
- backcall ==0.2.0 develop
- coverage ==5.5 develop
- decorator ==4.4.2 develop
- flake8 ==3.8.4 develop
- iniconfig ==1.1.1 develop
- ipython ==7.19.0 develop
- ipython-genutils ==0.2.0 develop
- isort ==5.6.4 develop
- jedi ==0.18.0 develop
- mccabe ==0.6.1 develop
- packaging ==20.9 develop
- parso ==0.8.1 develop
- pexpect ==4.8.0 develop
- pickleshare ==0.7.5 develop
- pluggy ==0.13.1 develop
- prompt-toolkit ==3.0.17 develop
- ptyprocess ==0.7.0 develop
- py ==1.10.0 develop
- pycodestyle ==2.6.0 develop
- pyflakes ==2.2.0 develop
- pygments ==2.8.1 develop
- pyparsing ==2.4.7 develop
- pytest ==6.1.2 develop
- pytest-cov ==2.10.1 develop
- pytest-datadir ==1.3.1 develop
- toml ==0.10.2 develop
- traitlets ==5.0.5 develop
- wcwidth ==0.2.5 develop
- blis ==0.4.1
- catalogue ==1.0.0
- certifi ==2020.12.5
- chardet ==4.0.0
- cymem ==2.0.5
- idna ==2.10
- murmurhash ==1.0.5
- numpy ==1.20.1
- plac ==1.1.3
- preshed ==3.0.5
- requests ==2.25.1
- spacy ==2.3.2
- srsly ==1.0.5
- thinc ==7.4.1
- tqdm ==4.59.0
- urllib3 ==1.26.3
- wasabi ==0.8.2
- importlib-metadata *
- spacy *
- actions/checkout v1 composite
- actions/setup-python v1 composite
- actions/checkout v2 composite
- github/codeql-action/analyze v1 composite
- github/codeql-action/autobuild v1 composite
- github/codeql-action/init v1 composite
- actions/checkout v2 composite
- actions/setup-python v2 composite
- actions/checkout master composite
- actions/setup-python v2 composite
- pypa/gh-action-pypi-publish master composite