cleandata

R Package "cleandata"

https://github.com/sherrisherry/cleandata

Science Score: 10.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
  • .zenodo.json file
  • DOI references
  • Academic publication links
  • Committers with academic emails
    1 of 3 committers (33.3%) from academic institutions
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (14.2%) to scientific vocabulary

Keywords

cran data-analysis data-mining machine-learning r r-package wrangling
Last synced: 6 months ago · JSON representation

Repository

R Package "cleandata"

Basic Info
Statistics
  • Stars: 3
  • Watchers: 0
  • Forks: 1
  • Open Issues: 0
  • Releases: 0
Topics
cran data-analysis data-mining machine-learning r r-package wrangling
Created over 7 years ago · Last pushed about 7 years ago
Metadata Files
Readme License

README.md

R Package "cleandata"

CRAN status DOWNLOADSTOTAL

A collection of functions that work with data frame to inspect and manipulate data; and to keep track of data manipulation by producing log files.

Available on CRAN: https://cran.r-project.org/package=cleandata

Demonstration: Wrangling Ames Housing Dataset

I planned to keep writing new demos and linking them in this Readme file.

New in V0.3.0

  • Made parameter 'log' able to take value from a 'log_arg' variable in the parent environment (dynamic scoping) of a function
    • The old way of assigning value to 'log' is also supported
    • 'log' is the parameter to control producing log files

List of Functions

  • Inspection

    • inspect_map: Classify The Columns of A Data Frame
    • inspect_na: Find Out Which Columns Have Most NAs
    • inspectsmap: A Simplified Thus Faster Version of inspectmap
  • Imputation

    • impute_mean: Impute Missing Values by Mean
    • impute_median: Impute Missing Values by Median
    • impute_mode: Impute Missing Values by Mode
  • Encoding

    • encode_binary: Encode Binary Data Into 0 and 1
    • encode_ordinal: Encode Ordinal Data Into Integers
    • encode_onehot: One Hot encoding
  • Partitioning

    • partition_random: Partition A Dataset Randomly
  • Other

    • wh_dict: Create Data Dictionary from Data Warehouse

Installation

You can install from CRAN:

r install.packages('cleandata')

Alternatively, you can download the source package from the release page of this GitHub repo.

```r

place the source package in your work directory

install.packages('cleandata_0.3.0.tar.gz', repos = NULL, type="source") ```

Usage

```r

loading the package

library('cleandata')

getting help

help(package = 'cleandata')

getting demos

browseVignettes('cleandata') ```

List of Top-level Contents

  • DESCRIPTION: general Information of "cleandata"
  • NAMESPACE: specifies what functions are accessible by users
  • LICENSE: license file of "cleandata"
  • NEWS.md: news of updates
  • cran-comments.md: a record of correspondence with CRAN
  • README.md: this file
  • R/: source codes of "cleandata"
  • man/: source codes of the documentation of "cleandata"
  • vignettes/: source codes of the vignettes of "cleandata"

Update History

  • 07/18/2018: version 0.1.0 was submitted to CRAN
  • 09/03/2018: version 0.2.0 was uploaded to GitHub
  • 09/13/2018: version 0.2.0 was submitted to CRAN
  • 11/30/2018: version 0.3.0 was submitted to CRAN

GitHub Events

Total
Last Year

Committers

Last synced: almost 3 years ago

All Time
  • Total Commits: 4
  • Total Committers: 3
  • Avg Commits per committer: 1.333
  • Development Distribution Score (DDS): 0.5
Top Committers
Name Email Commits
sherrisherry 3****y@u****m 2
Sherry Zhao s****o@g****u 1
amrrs a****s@o****m 1
Committer Domains (Top 20 + Academic)
gwu.edu: 1

Issues and Pull Requests

Last synced: 7 months ago

All Time
  • Total issues: 0
  • Total pull requests: 1
  • Average time to close issues: N/A
  • Average time to close pull requests: about 6 hours
  • Total issue authors: 0
  • Total pull request authors: 1
  • Average comments per issue: 0
  • Average comments per pull request: 2.0
  • Merged pull requests: 1
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 0
  • Pull requests: 0
  • Average time to close issues: N/A
  • Average time to close pull requests: N/A
  • Issue authors: 0
  • Pull request authors: 0
  • Average comments per issue: 0
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
Pull Request Authors
  • amrrs (1)
Top Labels
Issue Labels
Pull Request Labels

Packages

  • Total packages: 1
  • Total downloads:
    • cran 197 last-month
  • Total dependent packages: 0
  • Total dependent repositories: 0
  • Total versions: 3
  • Total maintainers: 1
cran.r-project.org: cleandata

To Inspect and Manipulate Data; and to Keep Track of This Process

  • Versions: 3
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Downloads: 197 Last month
Rankings
Forks count: 21.9%
Stargazers count: 26.2%
Dependent packages count: 29.8%
Average: 31.0%
Dependent repos count: 35.5%
Downloads: 41.7%
Maintainers (1)
Last synced: 6 months ago

Dependencies

DESCRIPTION cran
  • R >= 3.0.0 depends
  • stats * imports
  • R.rsp * suggests