arealdb
Harmonise and integrate heterogeneous areal data https://doi.org/10.1016/j.envsoft.2020.104799
Science Score: 26.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
○CITATION.cff file
-
✓codemeta.json file
Found codemeta.json file -
○.zenodo.json file
-
✓DOI references
Found 2 DOI reference(s) in README -
○Academic publication links
-
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (19.5%) to scientific vocabulary
Keywords
areal-data
database
Last synced: 10 months ago
·
JSON representation
Repository
Harmonise and integrate heterogeneous areal data https://doi.org/10.1016/j.envsoft.2020.104799
Basic Info
- Host: GitHub
- Owner: luckinet
- License: gpl-3.0
- Language: R
- Default Branch: master
- Homepage: https://luckinet.github.io/arealDB/
- Size: 6.85 MB
Statistics
- Stars: 2
- Watchers: 2
- Forks: 0
- Open Issues: 0
- Releases: 5
Topics
areal-data
database
Created over 6 years ago
· Last pushed over 1 year ago
Metadata Files
Readme
Changelog
License
README.Rmd
---
output: github_document
---
```{r, include = FALSE}
knitr::opts_chunk$set(
collapse = TRUE,
comment = "#>",
fig.path = "man/figures/README-",
out.width = "100%"
)
```
# arealDB
[](https://cran.r-project.org/package=arealDB)
[](https://github.com/luckinet/arealDB/actions)
[](https://lifecycle.r-lib.org/articles/stages.html#stable)
[](https://cran.r-project.org/package=arealDB)
## Overview
Areal data are a rather frequent type of data in many applications of the environmental and socio-economic sciences, where various aspects are summarized for particular areas such as administrative territories. Many of those applications surpass the spatial, temporal or thematic scope of any single data source, so that data must be harmonised and normalised across many distinct standards.
`arealDB` has been developed for the purpose of building a standardised database encompassing all issues that come with this. In the current, revised version, it makes use of the `ontologics` R-package to harmonise the names of territories (from geometries) and the target variables (from tables). Moreover, it uses the `tabshiftr` R-package to reshape disorganised tabular data into a common format.

## Installation
1) Install the official version from CRAN:
```{r, eval=FALSE}
install.packages("arealDB")
```
or the latest development version from github:
```{r, eval=FALSE}
devtools::install_github("luckinet/arealDB")
```
2) Read the [paper](https://doi.org/10.1016/j.envsoft.2020.104799) for a more scientific background, or study the vignette on [the arealDB pipeline](https://luckinet.github.io/arealDB/articles/arealDB.html).
## Getting started
To study how `arealDB` works, one can make use of the function `makeExampleDB()`, where the full process of building an areal database can be "simulated" with dummy data. This can be used to train yourself on a particular step based on a fully valid database up until a certain stage of the process. For instance, to set up database that has merely just been started, but doesn't contain any thematic data or geometries yet, one would use `makeExampleDB(path = paste0(tempdir(), "/newDB"), until = "start_arealDB")`.
In principle, `arealDB` follows a simple process involving three stages:
1. **Setup the database (*stage 1*):** To start a new areal database, one needs to specify a gazetteer that contains the valid names of territories and optionally an ontology containing harmonised labels for the concepts in the thematic data.
2. **Register data series, geometries and tables (*stage 2*):** A data item that shall be inserted into a database is registered by calling a register function, which records the configuration (to reorganise it internally into a common standard) of the file and meta-data. Just like the thematic data, which are typically in a table, the spatial data (geometries) and the data series are registered in that way.
3. **Normalize geometries and tables (*stage 3*):** After registering all relevant data, they are reshaped into a standardized database format. In this process terms of territories and target variables are "translated" according to gazetteer and ontology, spatial data are standardized and validated, thematic data are standardized and matched to spatial data, and the spatial data are matched with the optionally already existing spatial database, for instance if that has been built off the GADM (recommended) or GAUL or other standardized datasets.
## Acknowledgement
This work was supported by funding to Carsten Meyer through the Flexpool mechanism of the German Centre for Integrative Biodiversity Research (iDiv) (FZT-118, DFG).
Owner
- Name: LUCKINet
- Login: luckinet
- Kind: organization
- Website: https://www.idiv.de/en/luckinet.html
- Repositories: 3
- Profile: https://github.com/luckinet
Welcome to the "land-use change knowledge integration" networks' software repository
GitHub Events
Total
- Release event: 1
- Push event: 41
- Create event: 1
Last Year
- Release event: 1
- Push event: 41
- Create event: 1
Packages
- Total packages: 1
-
Total downloads:
- cran 252 last-month
- Total dependent packages: 0
- Total dependent repositories: 0
- Total versions: 5
- Total maintainers: 1
cran.r-project.org: arealDB
Harmonise and Integrate Heterogeneous Areal Data
- Homepage: https://github.com/luckinet/arealDB
- Documentation: http://cran.r-project.org/web/packages/arealDB/arealDB.pdf
- License: GPL-3
-
Latest release: 0.9.4
published over 1 year ago
Rankings
Stargazers count: 28.5%
Forks count: 28.6%
Dependent packages count: 29.1%
Dependent repos count: 34.8%
Average: 37.8%
Downloads: 68.0%
Maintainers (1)
Last synced:
10 months ago
Dependencies
.github/workflows/check-standard.yaml
actions
- actions/checkout v3 composite
- r-lib/actions/check-r-package v2 composite
- r-lib/actions/setup-pandoc v2 composite
- r-lib/actions/setup-r v2 composite
- r-lib/actions/setup-r-dependencies v2 composite
.github/workflows/pkgdown.yaml
actions
- JamesIves/github-pages-deploy-action v4.4.1 composite
- actions/checkout v3 composite
- r-lib/actions/setup-pandoc v2 composite
- r-lib/actions/setup-r v2 composite
- r-lib/actions/setup-r-dependencies v2 composite
.github/workflows/test-coverage.yaml
actions
- actions/checkout v3 composite
- actions/upload-artifact v3 composite
- r-lib/actions/setup-r v2 composite
- r-lib/actions/setup-r-dependencies v2 composite
DESCRIPTION
cran
- R >= 2.10 depends
- checkmate * imports
- dplyr * imports
- magrittr * imports
- ontologics * imports
- purrr * imports
- readr * imports
- rlang * imports
- rmapshaper * imports
- sf * imports
- stringr * imports
- tabshiftr * imports
- tibble * imports
- tidyr * imports
- tidyselect * imports
- covr * suggests
- knitr * suggests
- rmarkdown * suggests
- testthat * suggests