Andromeda

AsynchroNous Disk-based Representation of MassivE DAta: An R package aimed at replacing ff for storing large data objects.

https://github.com/ohdsi/andromeda

Science Score: 36.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
  • Academic publication links
  • Committers with academic emails
    1 of 13 committers (7.7%) from academic institutions
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (14.9%) to scientific vocabulary

Keywords

hades
Last synced: 9 months ago · JSON representation

Repository

AsynchroNous Disk-based Representation of MassivE DAta: An R package aimed at replacing ff for storing large data objects.

Basic Info
Statistics
  • Stars: 11
  • Watchers: 10
  • Forks: 13
  • Open Issues: 18
  • Releases: 4
Topics
hades
Created over 6 years ago · Last pushed 10 months ago
Metadata Files
Readme Changelog

README.md

Andromeda

Build Status codecov.io CRAN_Status_Badge CRAN_Status_Badge

Andromeda is part of HADES.

Introduction

AsynchroNous Disk-based Representation of MassivE DAta (ANDROMEDA): An R package for storing large data objects. Andromeda allow storing data objects on a local drive, while still making it possible to manipulate the data in an efficient manner.

Features

  • Allows storage of data objects much larger than what can fit in memory.
  • Integrates with dplyr package for data manipulation.
  • Objects are stored in a temporary location on the local file system.
  • Ability to save and load the objects to a compressed file in a permanent location on the local file system.

Examples

```r library(Andromeda) bigData <- andromeda()

Add some 'big' data:

bigData$cars <- cars

Manipulate using dplyr:

bigData$cars %>% filter(speed > 10) %>% count() %>% collect()

# A tibble: 1 x 1

n

1 41

saveAndromeda(bigData, "bigData.zip") close(bigData) ```

Technology

The Andromeda package is an R package wrapped around RSQLite.

System Requirements

Running the package requires R.

Installation

To install the latest development version, install from GitHub:

r install.packages("devtools") devtools::install_github("ohdsi/Andromeda")

User Documentation

Documentation can be found on the package website.

Support

Contributing

Read here how you can contribute to this package.

License

Andromeda is licensed under Apache License 2.0

Development

Andromeda is being developed in R Studio.

Development status

Beta. Use at your own risk.

Owner

  • Name: Observational Health Data Sciences and Informatics
  • Login: OHDSI
  • Kind: organization

GitHub Events

Total
  • Create event: 4
  • Release event: 2
  • Issues event: 5
  • Issue comment event: 13
  • Push event: 22
  • Pull request review event: 1
  • Pull request event: 9
  • Fork event: 4
Last Year
  • Create event: 4
  • Release event: 2
  • Issues event: 5
  • Issue comment event: 13
  • Push event: 22
  • Pull request review event: 1
  • Pull request event: 9
  • Fork event: 4

Committers

Last synced: 11 months ago

All Time
  • Total Commits: 149
  • Total Committers: 13
  • Avg Commits per committer: 11.462
  • Development Distribution Score (DDS): 0.664
Past Year
  • Commits: 19
  • Committers: 7
  • Avg Commits per committer: 2.714
  • Development Distribution Score (DDS): 0.474
Top Committers
Name Email Commits
schuemie s****e@o****g 50
Schuemie M****I@i****m 37
Adam Black a****k@o****m 26
Adam Black a****3 15
Marc Suchard m****d@u****u 9
Admin_mschuemi A****i@i****m 3
Adam Black a****k@a****n 3
Hadley Wickham h****m@g****m 1
Egill Axfjord Fridgeirsson e****x@g****m 1
Bill O'Brien w****n@g****m 1
OBrien w****n@v****v 1
OBrien W****n@v****v 1
Adam Black a****k@A****l 1
Committer Domains (Top 20 + Academic)

Issues and Pull Requests

Last synced: 10 months ago

All Time
  • Total issues: 39
  • Total pull requests: 34
  • Average time to close issues: 5 months
  • Average time to close pull requests: about 1 month
  • Total issue authors: 18
  • Total pull request authors: 7
  • Average comments per issue: 6.21
  • Average comments per pull request: 1.41
  • Merged pull requests: 22
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 2
  • Pull requests: 6
  • Average time to close issues: 2 days
  • Average time to close pull requests: 26 days
  • Issue authors: 1
  • Pull request authors: 3
  • Average comments per issue: 0.5
  • Average comments per pull request: 0.33
  • Merged pull requests: 3
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
  • schuemie (9)
  • ablack3 (5)
  • egillax (5)
  • lhjohn (2)
  • gowthamrao (2)
  • msuchard (2)
  • jreps (2)
  • Tolderck558 (1)
  • azimov (1)
  • solis9753 (1)
  • jduke99 (1)
  • DrTTrousers (1)
  • hms1 (1)
  • tomseinen (1)
  • mvankessel-EMC (1)
Pull Request Authors
  • ablack3 (21)
  • egillax (5)
  • schuemie (4)
  • hadley (2)
  • msuchard (2)
  • ob325 (2)
  • olivroy (2)
  • mgirlich (1)
Top Labels
Issue Labels
enhancement (2)
Pull Request Labels

Packages

  • Total packages: 1
  • Total downloads:
    • cran 1,806 last-month
  • Total docker downloads: 42,216
  • Total dependent packages: 5
  • Total dependent repositories: 44
  • Total versions: 17
  • Total maintainers: 1
cran.r-project.org: Andromeda

Asynchronous Disk-Based Representation of Massive Data

  • Versions: 17
  • Dependent Packages: 5
  • Dependent Repositories: 44
  • Downloads: 1,806 Last month
  • Docker Downloads: 42,216
Rankings
Docker downloads count: 0.6%
Dependent repos count: 3.8%
Forks count: 7.9%
Average: 8.6%
Dependent packages count: 10.9%
Downloads: 11.1%
Stargazers count: 17.4%
Maintainers (1)
Last synced: 10 months ago

Dependencies

DESCRIPTION cran
  • dplyr * depends
  • DBI * imports
  • RSQLite * imports
  • cli * imports
  • dbplyr * imports
  • hms * imports
  • methods * imports
  • pillar * imports
  • rlang * imports
  • tidyselect * imports
  • zip * imports
  • knitr * suggests
  • rJava * suggests
  • rcmdcheck * suggests
  • rmarkdown * suggests
  • stringr * suggests
  • testthat * suggests
  • tibble * suggests
.github/workflows/R_CMD_check_Hades.yaml actions
  • actions/cache v3 composite
  • actions/checkout v3 composite
  • actions/download-artifact v3 composite
  • actions/upload-artifact v3 composite
  • r-lib/actions/setup-pandoc v2 composite
  • r-lib/actions/setup-r v2 composite
  • r-lib/actions/setup-tinytex v2 composite
.github/workflows/R_CMD_check_main_weekly.yaml actions
  • actions/checkout v3 composite
  • r-lib/actions/check-r-package v2 composite
  • r-lib/actions/setup-pandoc v2 composite
  • r-lib/actions/setup-r v2 composite
  • r-lib/actions/setup-r-dependencies v2 composite
  • r-lib/actions/setup-tinytex v2 composite
.github/workflows/nightly_cleanup_Hades.yml actions
  • c-hive/gha-remove-artifacts v1 composite