txt-utils

CLI to batch process lines of a single text file.

https://github.com/stefantaubert/txt-utils

Science Score: 67.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
    Found 5 DOI reference(s) in README
  • Academic publication links
    Links to: zenodo.org
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (9.8%) to scientific vocabulary
Last synced: 6 months ago · JSON representation ·

Repository

CLI to batch process lines of a single text file.

Basic Info
  • Host: GitHub
  • Owner: stefantaubert
  • License: mit
  • Language: Python
  • Default Branch: master
  • Size: 212 KB
Statistics
  • Stars: 0
  • Watchers: 1
  • Forks: 0
  • Open Issues: 0
  • Releases: 3
Created almost 4 years ago · Last pushed about 2 years ago
Metadata Files
Readme Changelog Contributing License Code of conduct Citation

README.md

txt-utils

PyPI PyPI MIT PyPI PyPI PyPI DOI

CLI to modify text files.

Features

  • merge: merge multiple text files into one
  • extract-vocabulary: extract unit vocabulary
  • transcribe: transcribe units
  • replace: replace text
  • replace-line: replace text in a line
  • trim-units: trim units
  • remove-units: remove units
  • create-unit-occurrence-stats: create unit occurrence statistics

Roadmap

  • create n-grams
  • map units
  • merge units right/left
  • calculate units TF-IDF

Installation

sh pip install txt-utils --user

Usage

sh txt-utils-cli

Citation

If you want to cite this repo, you can use the BibTeX-entry generated by GitHub (see About => Cite this repository).

txt Taubert, S. (2024). txt-utils (Version 0.0.3) [Computer software]. https://doi.org/10.5281/zenodo.10571273

Acknowledgments

Funded by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) – Project-ID 416228727 – CRC 1410

Owner

  • Name: Stefan Taubert
  • Login: stefantaubert
  • Kind: user
  • Location: Chemnitz, Germany
  • Company: Chemnitz University of Technology

Currently I am working on my PhD about the topic of speech synthesis at Chemnitz University of Technology.

Citation (CITATION.cff)

cff-version: 1.2.0
title: txt-utils
abstract: CLI to modify text files.
message: >-
  If you use this software, please cite it using the
  metadata from this file.
type: software
authors:
  - email: github@stefantaubert.com
    given-names: Stefan
    family-names: Taubert
    affiliation: Chemnitz University of Technology
    orcid: 'https://orcid.org/0000-0002-4932-2874'
    website: 'https://stefantaubert.com/'
version: 0.0.3
date-released: 2024-01-26
license: MIT
url: https://github.com/stefantaubert/txt-utils
doi: 10.5281/zenodo.10571273

GitHub Events

Total
Last Year

Issues and Pull Requests

Last synced: 8 months ago

All Time
  • Total issues: 0
  • Total pull requests: 0
  • Average time to close issues: N/A
  • Average time to close pull requests: N/A
  • Total issue authors: 0
  • Total pull request authors: 0
  • Average comments per issue: 0
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 0
  • Pull requests: 0
  • Average time to close issues: N/A
  • Average time to close pull requests: N/A
  • Issue authors: 0
  • Pull request authors: 0
  • Average comments per issue: 0
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
Pull Request Authors
Top Labels
Issue Labels
Pull Request Labels

Packages

  • Total packages: 1
  • Total downloads:
    • pypi 22 last-month
  • Total dependent packages: 2
  • Total dependent repositories: 2
  • Total versions: 3
  • Total maintainers: 1
pypi.org: txt-utils

CLI to modify text files.

  • Versions: 3
  • Dependent Packages: 2
  • Dependent Repositories: 2
  • Downloads: 22 Last month
Rankings
Dependent packages count: 10.0%
Dependent repos count: 11.6%
Average: 28.7%
Forks count: 29.8%
Stargazers count: 38.8%
Downloads: 53.2%
Maintainers (1)
Last synced: 7 months ago

Dependencies

Pipfile pypi
  • autoflake * develop
  • autopep8 * develop
  • isort * develop
  • pycodestyle * develop
  • pylint * develop
  • pytest * develop
  • rope * develop
  • twine * develop
  • txt-utils * develop
  • ordered-set >=4.1.0
  • pandas *
  • pronunciation-dictionary >=0.0.4
  • tqdm *
Pipfile.lock pypi
  • astroid ==2.11.5 develop
  • attrs ==21.4.0 develop
  • autoflake ==1.4 develop
  • autopep8 ==1.6.0 develop
  • bleach ==5.0.0 develop
  • certifi ==2022.5.18.1 develop
  • cffi ==1.15.0 develop
  • charset-normalizer ==2.0.12 develop
  • commonmark ==0.9.1 develop
  • cryptography ==37.0.2 develop
  • dill ==0.3.5.1 develop
  • docutils ==0.18.1 develop
  • idna ==3.3 develop
  • importlib-metadata ==4.11.4 develop
  • iniconfig ==1.1.1 develop
  • isort ==5.10.1 develop
  • jeepney ==0.8.0 develop
  • keyring ==23.5.1 develop
  • lazy-object-proxy ==1.7.1 develop
  • mccabe ==0.7.0 develop
  • ordered-set ==4.1.0 develop
  • packaging ==21.3 develop
  • pkginfo ==1.8.2 develop
  • platformdirs ==2.5.2 develop
  • pluggy ==1.0.0 develop
  • pronunciation-dictionary ==0.0.4 develop
  • py ==1.11.0 develop
  • pycodestyle ==2.8.0 develop
  • pycparser ==2.21 develop
  • pyflakes ==2.4.0 develop
  • pygments ==2.12.0 develop
  • pylint ==2.13.9 develop
  • pyparsing ==3.0.9 develop
  • pytest ==7.1.2 develop
  • readme-renderer ==35.0 develop
  • requests ==2.27.1 develop
  • requests-toolbelt ==0.9.1 develop
  • rfc3986 ==2.0.0 develop
  • rich ==12.4.4 develop
  • rope ==1.0.0 develop
  • secretstorage ==3.3.2 develop
  • setuptools ==62.3.2 develop
  • six ==1.16.0 develop
  • toml ==0.10.2 develop
  • tomli ==2.0.1 develop
  • tqdm ==4.64.0 develop
  • twine ==4.0.0 develop
  • txt-utils * develop
  • typing-extensions ==4.2.0 develop
  • urllib3 ==1.26.9 develop
  • webencodings ==0.5.1 develop
  • wrapt ==1.14.1 develop
  • zipp ==3.8.0 develop
  • iterable-serialization ==0.0.1
  • numpy ==1.22.4
  • ordered-set ==4.1.0
  • pandas ==1.4.2
  • pronunciation-dictionary ==0.0.4
  • python-dateutil ==2.8.2
  • pytz ==2022.1
  • six ==1.16.0
  • tqdm ==4.64.0
pyproject.toml pypi
  • ordered_set >=4.1.0
  • pandas *
  • pronunciation_dictionary >=0.0.4
  • tqdm *