source-data

Repo for ingesting source data into the InTaVia Knowledge Graph (IKG)

https://github.com/intavia/source-data

Science Score: 67.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
    Found 3 DOI reference(s) in README
  • Academic publication links
    Links to: zenodo.org
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (7.8%) to scientific vocabulary
Last synced: 10 months ago · JSON representation ·

Repository

Repo for ingesting source data into the InTaVia Knowledge Graph (IKG)

Basic Info
  • Host: GitHub
  • Owner: InTaVia
  • Language: Python
  • Default Branch: main
  • Size: 17 MB
Statistics
  • Stars: 0
  • Watchers: 8
  • Forks: 0
  • Open Issues: 16
  • Releases: 2
Created about 3 years ago · Last pushed over 1 year ago
Metadata Files
Readme Citation

README.md

DOI

Source Data repo for InTaVia Knowledge Graph

This repo is used for ingesting named graphs into the InTaVia Knowledge Graph (IKG). The idea of the repository is to have a versioned source of truth that performs data validation (via SHACL) and creates versioning/provenance triples using GitHub actions. Pushes to the triplestore are only done from main. Following main branch it is therefore easily possible to recreate any previous state of the InTaVia triplestore. This fact is also used by the versioning system of the InTaVia Knowledge Graph as new versions of a named graph in the IKG does not replace the old. Instead a new named graph is uploaded and provenance information is added to the provenance graph.

How to contribute

To ingest your data: - prepare your data according to IDM-RDF and the Shacl shapes as a turtle file - add the prepared ttl to the datasets folder - if you are adding your data the first time, add the metadata of yor graph to the datasets.yml file - create a PR with these changes against main

After you created the PR a validation against the shacl constraints is performed. If your data is valid it will be merged to main and pushed to the IKG.

For the moment please commit only one turtle file per PR

Owner

  • Name: InTaVia
  • Login: InTaVia
  • Kind: organization

Citation (CITATION.cff)

cff-version: 1.2.0
message: "If you use this software, please cite it as below."
authors:
  - family-names: Schlögl
    given-names: Matthias
    orcid: https://orcid.org/0000-0003-1451-0987
  - family-names: Touminen
    given-names: Jouni
    orcid: https://orcid.org/0000-0003-4789-5676
  - family-names: de Boer
    given-names: Victor
    orcid: https://orcid.org/0000-0001-9079-039X
title: "InTaVia source data repository"
version: 0.1-beta
identifiers:
  - type: doi
    value: 10.5281/zenodo.10290205
date-released: 2023-12-7

GitHub Events

Total
  • Push event: 4
  • Pull request event: 4
  • Create event: 1
Last Year
  • Push event: 4
  • Pull request event: 4
  • Create event: 1

Dependencies

.github/workflows/create_prov_graph_and_push.yml actions
  • Gr1N/setup-poetry v8 composite
  • actions/checkout v4 composite
  • actions/setup-python v4 composite
  • tj-actions/changed-files v35 composite
.github/workflows/validate_and_push.yml actions
  • actions/checkout v3 composite
  • konradhoeffner/shacl master composite
  • tj-actions/changed-files v35 composite
.devcontainer/Dockerfile docker
  • mcr.microsoft.com/devcontainers/base jammy build
poetry.lock pypi
  • autopep8 2.0.2 develop
  • black 23.3.0 develop
  • mypy-extensions 1.0.0 develop
  • pathspec 0.11.1 develop
  • platformdirs 3.5.0 develop
  • pycodestyle 2.10.0 develop
  • certifi 2022.12.7
  • charset-normalizer 3.1.0
  • click 8.1.3
  • colorama 0.4.6
  • html5lib 1.1
  • idna 3.4
  • isodate 0.6.1
  • owlrl 6.0.2
  • packaging 23.1
  • prettytable 2.5.0
  • pyparsing 3.0.9
  • pyshacl 0.22.2
  • pyyaml 6.0
  • rdflib 6.3.2
  • requests 2.29.0
  • six 1.16.0
  • sparqlwrapper 2.0.0
  • urllib3 1.26.15
  • wcwidth 0.2.6
  • webencodings 0.5.1
pyproject.toml pypi
  • SPARQLWrapper ^2.0.0
  • click ^8.1.3
  • pyshacl ^0.22.2
  • python ^3.11
  • pyyaml ^6.0
  • rdflib ^6.3.2
  • requests ^2.29.0