RKaggle

An R package for downloading datasets from Kaggle into the R Console

https://github.com/benyamindsmith/rkaggle

Science Score: 39.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
    Found 2 DOI reference(s) in README
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (10.9%) to scientific vocabulary
Last synced: 9 months ago · JSON representation

Repository

An R package for downloading datasets from Kaggle into the R Console

Basic Info
  • Host: GitHub
  • Owner: benyamindsmith
  • License: other
  • Language: R
  • Default Branch: main
  • Size: 212 KB
Statistics
  • Stars: 50
  • Watchers: 1
  • Forks: 2
  • Open Issues: 2
  • Releases: 0
Created about 1 year ago · Last pushed 11 months ago
Metadata Files
Readme License

README.md

RKaggle

downloads total

An R package for downloading datasets from Kaggle into the R Console.

Installation

To install the latest version on CRAN, run:

r install.packages("RKaggle")

To install the development version, run:

```r

install.packages('devtools')

devtools::install_github("benyamindsmith/RKaggle")

```

Sample Usage

```r

library(RKaggle)

Download and read the "canadian-prime-ministers" dataset from Kaggle

canadianprimeministers <- RKaggle::getdataset("benjaminsmith/canadian-prime-ministers") canadianprime_ministers

A tibble: 29 × 5

No. Name Political Party Term Start Term End

1 1 (1 of 2) John A. Macdonald Liberal-Conservative 1 July 1867 5 November 1873 2 2 Alexander Mackenzie Liberal 7 November 1873 8 October 1878
3 1 (2 of 2) John A. Macdonald Liberal-Conservative 17 October 1878 6 June 1891
4 3 John Abbott Liberal-Conservative 16 June 1891 24 November 1891 5 4 John Thompson Liberal-Conservative 5 December 1892 12 December 1894 6 5 Mackenzie Bowell Conservative 21 December 1894 27 April 1896
7 6 Charles Tupper Conservative 1 May 1896 8 July 1896
8 7 Wilfrid Laurier Liberal 11 July 1896 10/6/1911
9 8 Robert Borden Government (Unionist) 10/10/1911 7/10/1920
10 9 (1 of 2) Arthur Meighen Conservative 7/10/1920 12/29/1921

ℹ 19 more rows

ℹ Use print(n = ...) to see more rows

```

Present Scope

RKaggle presently only supports the download and loading of the following file formats:

  • .csv
  • .tsv
  • .xlsx
  • .json
  • .rds
  • .parquet
  • .ods
  • .shp
  • .geojson
  • .feather

Blogs

  1. RObservations #51: Download Kaggle Datasets into the R Console with {RKaggle}

References

  1. Akhtar et al. (2024). Croissant: A Metadata Format for ML-Ready Datasets. Proceedings of the Eighth Workshop on Data Management for End-To-End Machine Learning, 4, 1–6. https://doi.org/10.1145/3650203.3663326

Similar Works

Acknowledgements

The TidyTuesday project from where I based the code for this package on initially.

Owner

  • Name: Benjamin Smith
  • Login: benyamindsmith
  • Kind: user
  • Location: Toronto ON

GitHub Events

Total
  • Issues event: 5
  • Watch event: 45
  • Public event: 1
  • Push event: 16
  • Fork event: 2
  • Create event: 1
Last Year
  • Issues event: 5
  • Watch event: 45
  • Public event: 1
  • Push event: 16
  • Fork event: 2
  • Create event: 1

Packages

  • Total packages: 1
  • Total downloads:
    • cran 727 last-month
  • Total dependent packages: 0
  • Total dependent repositories: 0
  • Total versions: 2
  • Total maintainers: 1
cran.r-project.org: RKaggle

'Kaggle' Dataset Downloader 'API'

  • Versions: 2
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Downloads: 727 Last month
Rankings
Dependent packages count: 26.9%
Dependent repos count: 33.1%
Average: 48.9%
Downloads: 86.9%
Last synced: 10 months ago

Dependencies

DESCRIPTION cran
  • R >= 4.1.0 depends
  • httr * imports
  • readr * imports
  • withr * imports