rasterly

Rapidly generate raster images from large datasets in R with Plotly.js

https://github.com/plotly/rasterly

Science Score: 10.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
  • .zenodo.json file
  • DOI references
  • Academic publication links
  • Committers with academic emails
    1 of 5 committers (20.0%) from academic institutions
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (17.3%) to scientific vocabulary

Keywords

data-visualization plotly plotly-dash r rstats
Last synced: 6 months ago · JSON representation

Repository

Rapidly generate raster images from large datasets in R with Plotly.js

Basic Info
  • Host: GitHub
  • Owner: plotly
  • License: other
  • Language: R
  • Default Branch: master
  • Homepage:
  • Size: 16.3 MB
Statistics
  • Stars: 48
  • Watchers: 6
  • Forks: 4
  • Open Issues: 0
  • Releases: 0
Archived
Topics
data-visualization plotly plotly-dash r rstats
Created over 6 years ago · Last pushed over 5 years ago
Metadata Files
Readme License

README.md

rasterly

Build Status Codecov test coverage CRAN status

Easily and rapidly generate raster data in R, even for larger volumes of data, with an aesthetics-based mapping syntax that should be familiar to users of the ggplot2 package.

While rasterly does not attempt to reproduce the full functionality of the Datashader graphics pipeline system for Python, the rasterly API has several core elements in common with that software package. Combined with Plotly.js and the plotly package, rasterly enables analysts to generate interactive figures which are responsive enough to embed into web applications.

Documentation: https://z267xu.github.io/rasterly/

Importing datasets for use with rasterly

There are several ways to import large datasets into R for use with rasterly; one option is the data.table package (https://cran.r-project.org/web/packages/data.table/vignettes/datatable-intro.html).

  • csv file: library(data.table) data <- data.table::fread("yourpath/somefile.csv") # or a link

  • parquet file: Apache Parquet is a column-oriented, open-source format which offers efficient data compression. There are a few options in R for importing Parquet data. One of these is the arrow package, now available on CRAN.

The package must build Apache Arrow first, so it may take a few minutes to install the first time around.

``` library(arrow) parquetdata <- readparquet("somefile.parquet")

returns a data.frame if sparklyr is not loaded, otherwise it will be a tibble

to obtain an ordinary data.frame, some slight postprocessing may be required

parquetdata <- base::as.data.frame(parquetdata)

```

  • fst file: The fst package is an excellent option for extremely fast serialization of large data frames in R. In addition to rapid compression using LZ4 and ZSTD, it provides support for multithreading to parallelize operations.

library(fst) fst_data <- read.fst("somefile.fst")

Installing the package

The rasterly package is now available from CRAN, and the most recent release will always be available on GitHub. To install the CRAN package: install.packages("rasterly") To install the current version available via GitHub instead: remotes::install_github("plotly/rasterly")

Visualizing data with rasterly

rasterly is inspired by the datashader package available for Python. Both provide the capability to generate raster data for rapid rendering of graphics for even very large datasets.

In terms of performance, datashader is faster but rasterly is comparable. rasterly aims to provide a user-friendly interface to generate raster data for use with the plotly package; it cannot be used for plotting or rendering figures on its own.

Producing an interactive graph with the plotly package

To illustrate the basic functionality provided by the package, we'll start by retrieving data on Uber trips taken in New York City from April 1st until September 30th of 2014. The dataset includes 4,533,327 observations.

```

Load New York Uber data

ridesRaw1 <- "https://raw.githubusercontent.com/plotly/datasets/master/uber-rides-data1.csv" %>% data.table::fread(stringsAsFactors = FALSE) ridesRaw2 <- "https://raw.githubusercontent.com/plotly/datasets/master/uber-rides-data2.csv" %>% data.table::fread(stringsAsFactors = FALSE) ridesRaw3 <- "https://raw.githubusercontent.com/plotly/datasets/master/uber-rides-data3.csv" %>% data.table::fread(stringsAsFactors = FALSE) ridesDf <- list(ridesRaw1, ridesRaw2, ridesRaw3) %>% data.table::rbindlist() ```

Now that the data are loaded, we can pass them to plot_ly and pipe the output into add_rasterly:

plot_ly(ridesDf, x = ~Lat, y = ~Lon) %>% add_rasterly_heatmap()

General usage

Pass the data into rasterly: ``` ridesDf %>% rasterly(mapping = aes(x = Lat, y = Lon)) %>% rasterly_points() -> p p

or use simplied rplot

with(ridesDf, rplot(x = Lat, y = Lon) ) ```

Note that, p is a list of environments. The display info can be accessed through r <- rasterly_build(p) str(r)

"r" contains image raster and other useful info (like numeric aggregation matrices) required to produce the image but it does not provide any graphs.

Example use in interactive web applications

The Uber NYC Rasterizer application in our Dash Gallery provides a simple live demo of the rasterly package in action. Check it out here!

Uber NYC Rasterizer screenshot

A second Dash for R application to visualize (a much larger) dataset from the US Census Bureau is also available.

Owner

  • Name: Plotly
  • Login: plotly
  • Kind: organization
  • Location: Montréal

GitHub Events

Total
Last Year

Committers

Last synced: almost 3 years ago

All Time
  • Total Commits: 315
  • Total Committers: 5
  • Avg Commits per committer: 63.0
  • Development Distribution Score (DDS): 0.365
Top Committers
Name Email Commits
z267xu z****u@g****m 200
z267xu 4****u@u****m 51
Ryan Patrick Kyle r****n@p****y 32
Ryan Patrick Kyle r****e@u****m 25
zehao xu z****u@u****a 7
Committer Domains (Top 20 + Academic)

Packages

  • Total packages: 1
  • Total downloads: unknown
  • Total dependent packages: 0
  • Total dependent repositories: 1
  • Total versions: 2
cran.r-project.org: rasterly

Easily and Rapidly Generate Raster Image Data with Support for 'Plotly.js'

  • Versions: 2
  • Dependent Packages: 0
  • Dependent Repositories: 1
Rankings
Stargazers count: 7.7%
Forks count: 10.1%
Average: 18.2%
Dependent repos count: 25.5%
Dependent packages count: 29.8%
Last synced: 7 months ago

Dependencies

DESCRIPTION cran
  • R >= 3.4.0 depends
  • Rcpp * depends
  • methods * depends
  • data.table * imports
  • ggplot2 * imports
  • grid * imports
  • magrittr * imports
  • plotly * imports
  • rlang * imports
  • stats * imports
  • covr * suggests
  • knitr * suggests
  • lubridate * suggests
  • rmarkdown * suggests
  • testthat * suggests