Science Score: 44.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (6.1%) to scientific vocabulary

Scientific Fields

Artificial Intelligence and Machine Learning Computer Science - 71% confidence
Last synced: 4 months ago · JSON representation ·

Repository

Basic Info
  • Host: GitHub
  • Owner: KyleChen2002
  • License: apache-2.0
  • Language: Python
  • Default Branch: master
  • Size: 9.16 MB
Statistics
  • Stars: 0
  • Watchers: 1
  • Forks: 0
  • Open Issues: 0
  • Releases: 0
Created over 1 year ago · Last pushed over 1 year ago
Metadata Files
Readme Contributing License Citation

README.md

Chinese Text Error Correction

This is a Chinese Text Error Correction software based on the Transformer-based MacBERT model. The software provides functionalities for correcting errors in text, Word documents, and PDF files. Additionally, it can automatically expand training samples using a provided dataset to generate a customized model. The software also supports model switching, allowing users to select different correction models as needed.

Features

  • Text Correction: Detects and corrects errors in Chinese text.
  • Word & PDF Correction: Corrects errors in Word and PDF files.
  • Training Sample Expansion: Automatically expands training samples based on a given dataset, enabling the generation of a custom correction model.
  • Model Switching: Allows users to switch between different correction models for flexible error correction.

Demonstration Video

Click here to watch the demo video


Feel free to explore and contribute to improve this software!

Owner

  • Login: KyleChen2002
  • Kind: user

Citation (CITATION.cff)

cff-version: 1.2.0
message: "If you use this software, please cite it as below."
authors:
- family-names: "Xu"
  given-names: "Ming"
  orcid: "https://orcid.org/0000-0003-3402-7159"
title: "Pycorrector: Text error correction tool"
url: "https://github.com/shibing624/pycorrector"
data-released: 2021-12-03
version: 0.4.2

GitHub Events

Total
  • Watch event: 1
Last Year
  • Watch event: 1

Dependencies

Dockerfile docker
  • centos 7 build
requirements-dev.txt pypi
  • jieba * development
  • loguru * development
  • numpy * development
  • paddlenlp * development
  • paddlepaddle-gpu * development
  • pandas * development
  • pypinyin * development
  • pytorch-lightning >=1.1.2 development
  • scikit-learn >=0.19.1 development
  • six * development
  • tensorboardX * development
  • torch >=1.3.1 development
  • transformers >=4.4.2 development
requirements.txt pypi
  • jieba *
  • loguru *
  • numpy *
  • pypinyin *
  • six *
setup.py pypi
  • jieba *
  • loguru *
  • numpy *
  • pypinyin *
  • six *