pyhelpers

PyHelpers: An open-source toolkit for facilitating Python users' data manipulation tasks

https://github.com/mikeqfu/pyhelpers

Science Score: 67.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
    Found 6 DOI reference(s) in README
  • Academic publication links
    Links to: zenodo.org
  • Committers with academic emails
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (16.4%) to scientific vocabulary

Keywords

data-manipulation data-preprocessing py-utils python python-utilities python-utility python-utils utilities
Last synced: 6 months ago · JSON representation ·

Repository

PyHelpers: An open-source toolkit for facilitating Python users' data manipulation tasks

Basic Info
Statistics
  • Stars: 12
  • Watchers: 3
  • Forks: 3
  • Open Issues: 1
  • Releases: 46
Topics
data-manipulation data-preprocessing py-utils python python-utilities python-utility python-utils utilities
Created almost 7 years ago · Last pushed 8 months ago
Metadata Files
Readme Changelog License Citation

README.md

PyHelpers

PyPI Release Version Python Version License ReadTheDocs Documentation GitHub Actions Workflow Status Codacy - Code Quality DOI

PyHelpers is an open-source Python package designed to streamline data (pre-)processing and manipulation tasks. It accommodates a wide range of functions and classes grounded in practical applications, making common data operations more accessible and efficient. This toolkit is particularly useful for Python learners, researchers and data scientists seeking to enhance their workflows.

The package supports handling various data types, such as geographical and textual data, allowing for flexibility for diverse data processing needs. It also simplifies data input and output operations by offering functionalities for managing many different file-like objects. In addition, PyHelpers facilitates communication with relational databases, such as PostgreSQL and Microsoft SQL Server. This capability greatly smooths data integration with database systems through efficient data storage and retrieval.

With its comprehensive suite of practical tools, PyHelpers simplifies complex data processing tasks and boosts productivity. It is ready to serve as an essential resource for effective data manipulation, management and analysis for anyone working with data in Python.

Installation

To install the latest version of PyHelpers from PyPI using pip:

bash $ pip install --upgrade pyhelpers

For more information, see the Installation.

Quick Start

For a concise guide on how to use PyHelpers, check out the Quick Start tutorial, which includes illustrative examples for each of the Modules.

These examples briefly demonstrate the capabilities of PyHelpers in facilitating data manipulation tasks and streamlining work processes.

Documentation

The complete PyHelpers Documentation is available in HTML and PDF formats.

It is hosted on Read the Docs, and the HTML version is also accessible via GitHub Pages. The documentation includes detailed examples, tutorials and comprehensive references to help users get the most out of PyHelpers.

Cite as

Fu, Q. (2020). PyHelpers: An open-source toolkit for facilitating Python users' data manipulation tasks. Zenodo. doi:10.5281/zenodo.4017438.

bibtex @software{Fu_PyHelpers_2020, author = {Fu, Qian}, title = {{PyHelpers: An open-source toolkit for facilitating Python users' data manipulation tasks}}, year = 2020, publisher = {Zenodo}, doi = {10.5281/zenodo.4017438}, license = {MIT}, url = {https://github.com/mikeqfu/pyhelpers} }

For specific version references, please refer to Zenodo.

License

PyHelpers is licensed under the MIT License.

Please note that this project was initially licensed under the GPLv3+ up to version 1.5.2. Starting with version 2.0.0, it has been re-licensed under the MIT License.

Owner

  • Name: Qian Fu
  • Login: mikeqfu
  • Kind: user
  • Company: University of Birmingham

Research Fellow @UoB-BCRRE

Citation (CITATION.cff)

cff-version: 1.2.0
title: >-
  PyHelpers: An open-source toolkit for facilitating Python
  users' data manipulation tasks
message: >-
  If you use PyHelpers and/or any code from its GitHub
  repository, please cite it using the metadata provided in
  this file. For specific version references of PyHelpers,
  please refer to Zenodo
  (https://zenodo.org/search?q=conceptrecid%3A%224017438%22&f=allversions%3Atrue&l=list&p=1&s=10&sort=version).
type: software
authors:
  - given-names: Qian
    family-names: Fu
    email: q.fu@bham.ac.uk
    affiliation: University of Birmingham
    orcid: 'https://orcid.org/0000-0002-6502-9934'
identifiers:
  - type: doi
    value: 10.5281/zenodo.4017438
    description: >-
      This DOI represents all versions of PyHelpers, and will
      always resolve to the latest one.
repository-code: 'https://github.com/mikeqfu/pyhelpers'
url: 'https://mikeqfu.github.io/pyhelpers/'
repository: 'https://pyhelpers.readthedocs.io'
repository-artifact: 'https://pypi.org/project/pyhelpers/'
abstract: >-
  PyHelpers is an open-source Python package designed to
  streamline data (pre-)processing and manipulation tasks.
  It accommodates a wide range of functions and classes
  grounded in practical applications, making common data
  operations more accessible and efficient. This toolkit is
  particularly useful for Python learners, researchers and
  data scientists seeking to enhance their workflows.

  The package supports handling various data types, such as
  geographical and textual data, allowing for flexibility
  for diverse data processing needs. It also simplifies data
  input and output operations by offering functionalities
  for managing many different file-like objects. In
  addition, PyHelpers facilitates communication with
  relational databases, such as PostgreSQL and Microsoft SQL
  Server. This capability greatly smooths data integration
  with database systems through efficient data storage and
  retrieval.

  With its comprehensive suite of practical tools, PyHelpers
  simplifies complex data processing tasks and boosts
  productivity. It is ready to serve as an essential
  resource for effective data manipulation, management and
  analysis for anyone working with data in Python.
keywords:
  - Python
  - Utilities
  - Data preprocessing
  - Data manipulation
  - Python utilities
  - Python utils
  - Python utility
license: MIT
date-released: '2020-09-06'

GitHub Events

Total
  • Create event: 37
  • Issues event: 4
  • Release event: 2
  • Watch event: 4
  • Delete event: 31
  • Push event: 75
  • Pull request event: 54
  • Fork event: 2
Last Year
  • Create event: 37
  • Issues event: 4
  • Release event: 2
  • Watch event: 4
  • Delete event: 31
  • Push event: 75
  • Pull request event: 54
  • Fork event: 2

Committers

Last synced: almost 3 years ago

All Time
  • Total Commits: 1,087
  • Total Committers: 3
  • Avg Commits per committer: 362.333
  • Development Distribution Score (DDS): 0.006
Top Committers
Name Email Commits
mikeqfu q****u@o****m 1,081
Qian Fu m****u@u****m 5
mikeqfu 2****v 1

Issues and Pull Requests

Last synced: 6 months ago

All Time
  • Total issues: 19
  • Total pull requests: 92
  • Average time to close issues: 1 day
  • Average time to close pull requests: 5 days
  • Total issue authors: 1
  • Total pull request authors: 4
  • Average comments per issue: 0.05
  • Average comments per pull request: 0.16
  • Merged pull requests: 71
  • Bot issues: 0
  • Bot pull requests: 11
Past Year
  • Issues: 4
  • Pull requests: 61
  • Average time to close issues: about 3 hours
  • Average time to close pull requests: about 1 hour
  • Issue authors: 1
  • Pull request authors: 1
  • Average comments per issue: 0.0
  • Average comments per pull request: 0.0
  • Merged pull requests: 55
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
  • mikeqfu (19)
Pull Request Authors
  • mikeqfu (84)
  • dependabot[bot] (11)
  • codacy-badger (4)
  • xyluo25 (3)
Top Labels
Issue Labels
enhancement (8) bug (7) good first issue (6) invalid (2) documentation (2) help wanted (1)
Pull Request Labels
enhancement (57) bug (39) documentation (39) dependencies (12) tests (5) good first issue (1) invalid (1)

Packages

  • Total packages: 1
  • Total downloads:
    • pypi 1,193 last-month
  • Total docker downloads: 162
  • Total dependent packages: 3
  • Total dependent repositories: 4
  • Total versions: 48
  • Total maintainers: 1
pypi.org: pyhelpers

An open-source toolkit for facilitating Python users' data manipulation tasks

  • Versions: 48
  • Dependent Packages: 3
  • Dependent Repositories: 4
  • Downloads: 1,193 Last month
  • Docker Downloads: 162
Rankings
Dependent packages count: 2.2%
Dependent repos count: 7.7%
Downloads: 10.2%
Average: 11.3%
Forks count: 17.0%
Stargazers count: 19.4%
Maintainers (1)
Last synced: 6 months ago

Dependencies

requirements.txt pypi
  • atomicwrites ==1.4.1
  • build ==0.9.0
  • fuzzywuzzy ==0.18.0
  • gdal ==3.4.3
  • jupyter ==1.0.0
  • jupyterlab ==3.5.2
  • matplotlib ==3.6.2
  • networkx ==3.0
  • nltk ==3.8.1
  • openpyxl ==3.0.10
  • orjson ==3.8.4
  • pandas ==1.5.2
  • pdfkit ==1.0.0
  • pip-chill ==1.0.1
  • psycopg2 ==2.9.5
  • pyarrow ==10.0.1
  • pyodbc ==4.0.35
  • pypandoc ==1.10
  • pyproj ==3.4.1
  • pytest-cov ==4.0.0
  • python-levenshtein ==0.20.9
  • python-rapidjson ==1.9
  • pyxlsb ==1.0.10
  • scikit-learn ==1.2.0
  • shapely ==2.0.0
  • sphinx-copybutton ==0.5.1
  • sphinx-rtd-theme ==1.1.1
  • sqlalchemy ==1.4.46
  • twine ==4.0.2
  • ujson ==5.7.0
  • xlrd ==2.0.1
  • xlsx2csv ==0.8.0
  • xlsxwriter ==3.0.6
pyproject.toml pypi
setup.py pypi