txt-utils
CLI to batch process lines of a single text file.
Science Score: 67.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
✓DOI references
Found 5 DOI reference(s) in README -
✓Academic publication links
Links to: zenodo.org -
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (9.8%) to scientific vocabulary
Repository
CLI to batch process lines of a single text file.
Basic Info
- Host: GitHub
- Owner: stefantaubert
- License: mit
- Language: Python
- Default Branch: master
- Size: 212 KB
Statistics
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
- Releases: 3
Metadata Files
README.md
txt-utils
CLI to modify text files.
Features
merge: merge multiple text files into oneextract-vocabulary: extract unit vocabularytranscribe: transcribe unitsreplace: replace textreplace-line: replace text in a linetrim-units: trim unitsremove-units: remove unitscreate-unit-occurrence-stats: create unit occurrence statistics
Roadmap
- create n-grams
- map units
- merge units right/left
- calculate units TF-IDF
Installation
sh
pip install txt-utils --user
Usage
sh
txt-utils-cli
Citation
If you want to cite this repo, you can use the BibTeX-entry generated by GitHub (see About => Cite this repository).
txt
Taubert, S. (2024). txt-utils (Version 0.0.3) [Computer software]. https://doi.org/10.5281/zenodo.10571273
Acknowledgments
Funded by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) – Project-ID 416228727 – CRC 1410
Owner
- Name: Stefan Taubert
- Login: stefantaubert
- Kind: user
- Location: Chemnitz, Germany
- Company: Chemnitz University of Technology
- Website: https://stefantaubert.com
- Twitter: Stefan_Taubert
- Repositories: 75
- Profile: https://github.com/stefantaubert
Currently I am working on my PhD about the topic of speech synthesis at Chemnitz University of Technology.
Citation (CITATION.cff)
cff-version: 1.2.0
title: txt-utils
abstract: CLI to modify text files.
message: >-
If you use this software, please cite it using the
metadata from this file.
type: software
authors:
- email: github@stefantaubert.com
given-names: Stefan
family-names: Taubert
affiliation: Chemnitz University of Technology
orcid: 'https://orcid.org/0000-0002-4932-2874'
website: 'https://stefantaubert.com/'
version: 0.0.3
date-released: 2024-01-26
license: MIT
url: https://github.com/stefantaubert/txt-utils
doi: 10.5281/zenodo.10571273
GitHub Events
Total
Last Year
Issues and Pull Requests
Last synced: 8 months ago
All Time
- Total issues: 0
- Total pull requests: 0
- Average time to close issues: N/A
- Average time to close pull requests: N/A
- Total issue authors: 0
- Total pull request authors: 0
- Average comments per issue: 0
- Average comments per pull request: 0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0
Past Year
- Issues: 0
- Pull requests: 0
- Average time to close issues: N/A
- Average time to close pull requests: N/A
- Issue authors: 0
- Pull request authors: 0
- Average comments per issue: 0
- Average comments per pull request: 0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0
Top Authors
Issue Authors
Pull Request Authors
Top Labels
Issue Labels
Pull Request Labels
Packages
- Total packages: 1
-
Total downloads:
- pypi 22 last-month
- Total dependent packages: 2
- Total dependent repositories: 2
- Total versions: 3
- Total maintainers: 1
pypi.org: txt-utils
CLI to modify text files.
- Homepage: https://github.com/stefantaubert/txt-utils
- Documentation: https://txt-utils.readthedocs.io/
- License: MIT
-
Latest release: 0.0.3
published about 2 years ago
Rankings
Maintainers (1)
Dependencies
- autoflake * develop
- autopep8 * develop
- isort * develop
- pycodestyle * develop
- pylint * develop
- pytest * develop
- rope * develop
- twine * develop
- txt-utils * develop
- ordered-set >=4.1.0
- pandas *
- pronunciation-dictionary >=0.0.4
- tqdm *
- astroid ==2.11.5 develop
- attrs ==21.4.0 develop
- autoflake ==1.4 develop
- autopep8 ==1.6.0 develop
- bleach ==5.0.0 develop
- certifi ==2022.5.18.1 develop
- cffi ==1.15.0 develop
- charset-normalizer ==2.0.12 develop
- commonmark ==0.9.1 develop
- cryptography ==37.0.2 develop
- dill ==0.3.5.1 develop
- docutils ==0.18.1 develop
- idna ==3.3 develop
- importlib-metadata ==4.11.4 develop
- iniconfig ==1.1.1 develop
- isort ==5.10.1 develop
- jeepney ==0.8.0 develop
- keyring ==23.5.1 develop
- lazy-object-proxy ==1.7.1 develop
- mccabe ==0.7.0 develop
- ordered-set ==4.1.0 develop
- packaging ==21.3 develop
- pkginfo ==1.8.2 develop
- platformdirs ==2.5.2 develop
- pluggy ==1.0.0 develop
- pronunciation-dictionary ==0.0.4 develop
- py ==1.11.0 develop
- pycodestyle ==2.8.0 develop
- pycparser ==2.21 develop
- pyflakes ==2.4.0 develop
- pygments ==2.12.0 develop
- pylint ==2.13.9 develop
- pyparsing ==3.0.9 develop
- pytest ==7.1.2 develop
- readme-renderer ==35.0 develop
- requests ==2.27.1 develop
- requests-toolbelt ==0.9.1 develop
- rfc3986 ==2.0.0 develop
- rich ==12.4.4 develop
- rope ==1.0.0 develop
- secretstorage ==3.3.2 develop
- setuptools ==62.3.2 develop
- six ==1.16.0 develop
- toml ==0.10.2 develop
- tomli ==2.0.1 develop
- tqdm ==4.64.0 develop
- twine ==4.0.0 develop
- txt-utils * develop
- typing-extensions ==4.2.0 develop
- urllib3 ==1.26.9 develop
- webencodings ==0.5.1 develop
- wrapt ==1.14.1 develop
- zipp ==3.8.0 develop
- iterable-serialization ==0.0.1
- numpy ==1.22.4
- ordered-set ==4.1.0
- pandas ==1.4.2
- pronunciation-dictionary ==0.0.4
- python-dateutil ==2.8.2
- pytz ==2022.1
- six ==1.16.0
- tqdm ==4.64.0
- ordered_set >=4.1.0
- pandas *
- pronunciation_dictionary >=0.0.4
- tqdm *