comp-unstructured-data

Scripts to explore the conditions that determine the reliability of models, trends and status by comparing aggregated cubes with structured monitoring schemes

https://github.com/b-cubed-eu/comp-unstructured-data

Science Score: 44.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (10.6%) to scientific vocabulary

Keywords

data-cubes data-quality r rstats structured-data unstructured-data
Last synced: 6 months ago · JSON representation ·

Repository

Scripts to explore the conditions that determine the reliability of models, trends and status by comparing aggregated cubes with structured monitoring schemes

Basic Info
  • Host: GitHub
  • Owner: b-cubed-eu
  • License: mit
  • Language: R
  • Default Branch: main
  • Homepage:
  • Size: 1.59 MB
Statistics
  • Stars: 2
  • Watchers: 1
  • Forks: 0
  • Open Issues: 1
  • Releases: 0
Topics
data-cubes data-quality r rstats structured-data unstructured-data
Created over 1 year ago · Last pushed 6 months ago
Metadata Files
Readme Contributing License Code of conduct Citation Zenodo

README.md

GitHub repo status GitHub repo size <!-- badges: end -->

Compare unstructured data

Langeraert, WardORCID logo[^aut][^cre][^INBO] Cartuyvels, EmmaORCID logo[^aut][^INBO] Van Daele, ToonORCID logo[^aut][^INBO] Research Institute for Nature and Forest (INBO)[^cph] European Union's Horizon Europe Research and Innovation Programme (ID No 101059592)^fnd

[^cph]: copyright holder

[^cre]: contact person [^INBO]: Research Institute for Nature and Forest (INBO), Herman Teirlinckgebouw, Havenlaan 88 PO Box 73, B-1000 Brussels, Belgium

keywords: structured data; data quality; unstructured data; data cubes; biodiversity informatics

Scripts to explore the conditions that determine the reliability of models, trends and status by comparing aggregated cubes with structured monitoring schemes. <!-- description: end -->

This code is developed in context of T4.5 of the B-Cubed project.

Analyses workflow

To download the latest version of each of the data sets run prepare_abv_data.Rmd and Prepare_data_10km.Rmd. Alternatively, you can download the exact same data we used by following the GBIF links in these same Rmd's. These data sets are saved under data/raw, the Rmd's further clean the data and add geometric properties. The cleaned data is stored in both .csv and .gpkg format under data/interim.

To get the list of ABV birds used to filter the data in both pipelines run the get_abv_species.R after sourcing the functions in taxon_mapping.R.

To run the targets pipelines, run run_pipeline.R in the folder of the pipeline you want to run. Afterwards, you can run the associated Rmd found under reports.

Repo structure

├── source ├ │ ├── pipelines ├ target pipelines https://books.ropensci.org/targets/ │ ├── R ├ R scripts and R markdown files │ └── reports ├ reports based on output from target pipelines ├── data │ ├── raw ├ create this folder and store raw data, see prepare_abv_data.Rmd │ ├── intermediate ├ store intermediate data │ └── processed ├ store processed data ├── checklist.yml ├ options checklist package (https://github.com/inbo/checklist) ├── organisation.yml ├ organisation info (https://inbo.github.io/checklist/articles/organisation.html) ├── inst │ └── en_gb.dic ├ dictionary with words that should not be checked by the checklist package ├── .github │ │ ├── workflows │ │ │ └── checklist_project.yml ├ GitHub repo settings │ ├── CODE_OF_CONDUCT.md │ │ └── CONTRIBUTING.md │ ├── comp-unstructured-data.Rproj ├ R project ├── README.md ├ project description ├── LICENSE.md ├ licence ├── LICENSE │ ├── CITATION.cff ├ citation info ├── .zenodo.json ├ zenodo metadata └── .gitignore ├ files to ignore

Owner

  • Name: B-Cubed
  • Login: b-cubed-eu
  • Kind: organization

Biodiversity Building Blocks for Policy

Citation (CITATION.cff)

cff-version: 1.2.0
message: If you use this software, please cite it using these metadata.
title: "Compare unstructured data"
authors:
- given-names: Ward
  family-names: Langeraert
  affiliation: Research Institute for Nature and Forest (INBO), Herman Teirlinckgebouw,
    Havenlaan 88 PO Box 73, B-1000 Brussels, Belgium
  orcid: 0000-0002-5900-8109
- given-names: Emma
  family-names: Cartuyvels
  affiliation: Research Institute for Nature and Forest (INBO), Herman Teirlinckgebouw,
    Havenlaan 88 PO Box 73, B-1000 Brussels, Belgium
  orcid: 0000-0001-7856-6360
- given-names: Toon
  family-names: Van Daele
  affiliation: Research Institute for Nature and Forest (INBO), Herman Teirlinckgebouw,
    Havenlaan 88 PO Box 73, B-1000 Brussels, Belgium
  orcid: 0000-0002-1362-853X
keywords:
- structured data
- data quality
- unstructured data
- data cubes
- biodiversity informatics
contact:
- given-names: Ward
  family-names: Langeraert
  affiliation: Research Institute for Nature and Forest (INBO), Herman Teirlinckgebouw,
    Havenlaan 88 PO Box 73, B-1000 Brussels, Belgium
  orcid: 0000-0002-5900-8109
doi: ~
license: MIT
repository-code: https://github.com/b-cubed-eu/comp-unstructured-data/
type: software
abstract: "Scripts to explore the conditions that determine the reliability of models,
  trends and status by comparing aggregated cubes with structured monitoring schemes."

GitHub Events

Total
  • Issues event: 1
  • Watch event: 1
  • Delete event: 3
  • Issue comment event: 2
  • Push event: 51
  • Pull request review comment event: 3
  • Pull request review event: 10
  • Pull request event: 7
  • Create event: 5
Last Year
  • Issues event: 1
  • Watch event: 1
  • Delete event: 3
  • Issue comment event: 2
  • Push event: 51
  • Pull request review comment event: 3
  • Pull request review event: 10
  • Pull request event: 7
  • Create event: 5

Issues and Pull Requests

Last synced: 6 months ago

All Time
  • Total issues: 1
  • Total pull requests: 9
  • Average time to close issues: N/A
  • Average time to close pull requests: about 1 month
  • Total issue authors: 1
  • Total pull request authors: 2
  • Average comments per issue: 0.0
  • Average comments per pull request: 0.44
  • Merged pull requests: 9
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 1
  • Pull requests: 8
  • Average time to close issues: N/A
  • Average time to close pull requests: about 1 month
  • Issue authors: 1
  • Pull request authors: 2
  • Average comments per issue: 0.0
  • Average comments per pull request: 0.5
  • Merged pull requests: 8
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
  • wlangera (1)
Pull Request Authors
  • wlangera (7)
  • EmmaCartuyvels1 (3)
Top Labels
Issue Labels
Pull Request Labels

Dependencies

.github/workflows/check_project.yml actions
  • inbo/actions/check_project main composite