daltoolboxdp

DAL ToolBox Data Preprocessing

https://github.com/cefet-rj-dal/daltoolboxdp

Science Score: 39.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
    Found 2 DOI reference(s) in README
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (18.0%) to scientific vocabulary
Last synced: 10 months ago · JSON representation

Repository

DAL ToolBox Data Preprocessing

Basic Info
  • Host: GitHub
  • Owner: cefet-rj-dal
  • License: other
  • Language: R
  • Default Branch: main
  • Homepage:
  • Size: 2.92 MB
Statistics
  • Stars: 3
  • Watchers: 3
  • Forks: 1
  • Open Issues: 0
  • Releases: 0
Created about 2 years ago · Last pushed 12 months ago
Metadata Files
Readme License

README.Rmd

---
output: github_document
---



```{r, include = FALSE}
knitr::opts_chunk$set(
  collapse = TRUE,
  comment = "#>",
  fig.path = "man/figures/README-",
  out.width = "100%"
)
```

# Logo do pacote daltoolboxdp Python-Based Extensions for Data Analytics Workflows


![GitHub Stars](https://img.shields.io/github/stars/cefet-rj-dal/daltoolboxdp?logo=Github)
![CRAN Downloads](https://cranlogs.r-pkg.org/badges/daltoolboxdp)


Python-Based Extensions for Data Analytics Workflows provides Python-based extensions to enhance data analytics workflows, particularly for tasks involving data preprocessing and predictive modeling. It includes tools for:

- Data sampling and transformation  
- Feature selection  
- Balancing strategies (e.g., SMOTE)  
- Model construction and tuning

These capabilities leverage Python libraries via the `reticulate` interface, enabling seamless integration with the broader Python machine learning ecosystem. The package supports instance selection and hybrid workflows that combine R and Python functionalities for flexible and reproducible analytical pipelines.

The architecture is inspired by the **Experiment Lines** approach, which promotes modularity, extensibility, and interoperability across tools.  
More information on Experiment Lines is available in [Ogasawara et al. (2009)](https://doi.org/10.1007/978-3-642-02279-1_20).

---

# Examples

Example scripts are available at:

- [Examples](https://github.com/cefet-rj-dal/daltoolboxdp/tree/main/examples)
- [Autoenconders](https://github.com/cefet-rj-dal/daltoolboxdp/tree/main/autoencoder)
- [Time Series](https://github.com/cefet-rj-dal/daltoolboxdp/tree/main/timeseries)

---

# Installation

You can install the latest stable version from CRAN:

```r
install.packages("daltoolboxdp")
```

To install the development version from GitHub:

```r
library(devtools)
devtools::install_github("cefet-rj-dal/daltoolboxdp", force = TRUE, dependencies = FALSE, upgrade = "never")
```

---

# Bug reports and feature requests

Please report issues or suggest new features via:

- [GitHub Issues](https://github.com/cefet-rj-dal/daltoolboxdp/issues)

Owner

  • Name: Data Analytics Lab
  • Login: cefet-rj-dal
  • Kind: organization

GitHub Events

Total
  • Watch event: 2
  • Member event: 1
  • Push event: 74
  • Gollum event: 2
  • Pull request event: 6
  • Fork event: 2
  • Create event: 2
Last Year
  • Watch event: 2
  • Member event: 1
  • Push event: 74
  • Gollum event: 2
  • Pull request event: 6
  • Fork event: 2
  • Create event: 2

Issues and Pull Requests

Last synced: 10 months ago

All Time
  • Total issues: 0
  • Total pull requests: 3
  • Average time to close issues: N/A
  • Average time to close pull requests: 23 days
  • Total issue authors: 0
  • Total pull request authors: 1
  • Average comments per issue: 0
  • Average comments per pull request: 0.0
  • Merged pull requests: 3
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 0
  • Pull requests: 3
  • Average time to close issues: N/A
  • Average time to close pull requests: 23 days
  • Issue authors: 0
  • Pull request authors: 1
  • Average comments per issue: 0
  • Average comments per pull request: 0.0
  • Merged pull requests: 3
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
Pull Request Authors
  • Diego07101985 (3)
Top Labels
Issue Labels
Pull Request Labels

Packages

  • Total packages: 1
  • Total downloads:
    • cran 21,268 last-month
  • Total dependent packages: 0
  • Total dependent repositories: 0
  • Total versions: 7
  • Total maintainers: 1
cran.r-project.org: daltoolboxdp

Python-Based Extensions for Data Analytics Workflows

  • Versions: 7
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Downloads: 21,268 Last month
Rankings
Dependent packages count: 28.0%
Dependent repos count: 36.0%
Average: 49.6%
Downloads: 84.8%
Maintainers (1)
Last synced: 11 months ago

Dependencies

DESCRIPTION cran
  • FSelector * imports
  • daltoolbox * imports
  • doBy * imports
  • glmnet * imports
  • leaps * imports
  • smotefamily * imports