https://github.com/buchananja/dpyp
A convenience tool for small-scale data pipelines in Python
Science Score: 26.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
○CITATION.cff file
-
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
○DOI references
-
○Academic publication links
-
○Committers with academic emails
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (9.3%) to scientific vocabulary
Keywords
data
data-analysis
data-cleaning
data-engineering
data-pipeline
data-preprocessing
data-processing
data-science
pandas
pipeline
Last synced: 6 months ago
·
JSON representation
Repository
A convenience tool for small-scale data pipelines in Python
Basic Info
- Host: GitHub
- Owner: buchananja
- License: mit
- Language: Python
- Default Branch: main
- Homepage: https://pypi.org/project/dpyp/
- Size: 4.28 MB
Statistics
- Stars: 1
- Watchers: 1
- Forks: 0
- Open Issues: 2
- Releases: 1
Topics
data
data-analysis
data-cleaning
data-engineering
data-pipeline
data-preprocessing
data-processing
data-science
pandas
pipeline
Created about 2 years ago
· Last pushed over 1 year ago
Metadata Files
Readme
Contributing
License
README.md
dpyp
A convenience tool for small-scale data pipelines in Python
About
dpyp is a data-pipeline convenience tool containing functionality for reading and writing batches, cleaning data, diagnosing pipelines, manipulating text, and calculating fields in Python.
Usage
- dpyp consists of seven modules: 'calculate', 'clean', 'diagnose', 'read', 'text', 'write', and 'transform'.
- Designed for use in small-scale Python pipelines with an emphasis on batch-processing via 'data-dictionaries'.
- Batch processing of data via dictionaries allows iterative functions to improve readability and ease of use.
- Built using a combination of base Python and pandas for writing robust small-scale pipelines with text manipulation capabilities.
Dependencies
- pandas
- pyarrow
- numpy
Installation
bash
pip install dpyp
License
See LICENSE.md
Contributing
See CONTRIBUTING.md
Owner
- Name: James Buchanan
- Login: buchananja
- Kind: user
- Repositories: 1
- Profile: https://github.com/buchananja
GitHub Events
Total
Last Year
Committers
Last synced: 8 months ago
Top Committers
| Name | Commits | |
|---|---|---|
| buchananja | b****b@p****e | 247 |
| buchananja | b****a@p****e | 2 |
Issues and Pull Requests
Last synced: 7 months ago
All Time
- Total issues: 7
- Total pull requests: 0
- Average time to close issues: 7 days
- Average time to close pull requests: N/A
- Total issue authors: 1
- Total pull request authors: 0
- Average comments per issue: 0.29
- Average comments per pull request: 0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0
Past Year
- Issues: 0
- Pull requests: 0
- Average time to close issues: N/A
- Average time to close pull requests: N/A
- Issue authors: 0
- Pull request authors: 0
- Average comments per issue: 0
- Average comments per pull request: 0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0
Top Authors
Issue Authors
- buchananja (1)
Pull Request Authors
Top Labels
Issue Labels
Pull Request Labels
Packages
- Total packages: 1
-
Total downloads:
- pypi 13 last-month
- Total dependent packages: 0
- Total dependent repositories: 0
- Total versions: 1
- Total maintainers: 1
pypi.org: dpyp
A pandas convenience wrapper for small-scale data pipelines
- Homepage: https://github.com/buchananja/dpyp
- Documentation: https://dpyp.readthedocs.io/
- License: MIT
-
Latest release: 1.0.0
published almost 2 years ago
Rankings
Dependent packages count: 9.5%
Average: 36.1%
Dependent repos count: 62.7%
Maintainers (1)
Last synced:
6 months ago
Dependencies
dpypr.egg-info/requires.txt
pypi
- pandas *
- pyarrow *
setup.py
pypi
- pandas *
- pyarrow *