source-data
Repo for ingesting source data into the InTaVia Knowledge Graph (IKG)
Science Score: 67.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
✓DOI references
Found 3 DOI reference(s) in README -
✓Academic publication links
Links to: zenodo.org -
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (7.8%) to scientific vocabulary
Repository
Repo for ingesting source data into the InTaVia Knowledge Graph (IKG)
Basic Info
- Host: GitHub
- Owner: InTaVia
- Language: Python
- Default Branch: main
- Size: 17 MB
Statistics
- Stars: 0
- Watchers: 8
- Forks: 0
- Open Issues: 16
- Releases: 2
Metadata Files
README.md
Source Data repo for InTaVia Knowledge Graph
This repo is used for ingesting named graphs into the InTaVia Knowledge Graph (IKG). The idea of the repository is to have a versioned source of truth that performs data validation (via SHACL) and creates versioning/provenance triples using GitHub actions. Pushes to the triplestore are only done from main. Following main branch it is therefore easily possible to recreate any previous state of the InTaVia triplestore. This fact is also used by the versioning system of the InTaVia Knowledge Graph as new versions of a named graph in the IKG does not replace the old. Instead a new named graph is uploaded and provenance information is added to the provenance graph.
How to contribute
To ingest your data:
- prepare your data according to IDM-RDF and the Shacl shapes as a turtle file
- add the prepared ttl to the datasets folder
- if you are adding your data the first time, add the metadata of yor graph to the datasets.yml file
- create a PR with these changes against main
After you created the PR a validation against the shacl constraints is performed. If your data is valid it will be merged to main and pushed to the IKG.
For the moment please commit only one turtle file per PR
Owner
- Name: InTaVia
- Login: InTaVia
- Kind: organization
- Website: intavia.eu
- Twitter: projectintavia
- Repositories: 10
- Profile: https://github.com/InTaVia
Citation (CITATION.cff)
cff-version: 1.2.0
message: "If you use this software, please cite it as below."
authors:
- family-names: Schlögl
given-names: Matthias
orcid: https://orcid.org/0000-0003-1451-0987
- family-names: Touminen
given-names: Jouni
orcid: https://orcid.org/0000-0003-4789-5676
- family-names: de Boer
given-names: Victor
orcid: https://orcid.org/0000-0001-9079-039X
title: "InTaVia source data repository"
version: 0.1-beta
identifiers:
- type: doi
value: 10.5281/zenodo.10290205
date-released: 2023-12-7
GitHub Events
Total
- Push event: 4
- Pull request event: 4
- Create event: 1
Last Year
- Push event: 4
- Pull request event: 4
- Create event: 1
Dependencies
- Gr1N/setup-poetry v8 composite
- actions/checkout v4 composite
- actions/setup-python v4 composite
- tj-actions/changed-files v35 composite
- actions/checkout v3 composite
- konradhoeffner/shacl master composite
- tj-actions/changed-files v35 composite
- mcr.microsoft.com/devcontainers/base jammy build
- autopep8 2.0.2 develop
- black 23.3.0 develop
- mypy-extensions 1.0.0 develop
- pathspec 0.11.1 develop
- platformdirs 3.5.0 develop
- pycodestyle 2.10.0 develop
- certifi 2022.12.7
- charset-normalizer 3.1.0
- click 8.1.3
- colorama 0.4.6
- html5lib 1.1
- idna 3.4
- isodate 0.6.1
- owlrl 6.0.2
- packaging 23.1
- prettytable 2.5.0
- pyparsing 3.0.9
- pyshacl 0.22.2
- pyyaml 6.0
- rdflib 6.3.2
- requests 2.29.0
- six 1.16.0
- sparqlwrapper 2.0.0
- urllib3 1.26.15
- wcwidth 0.2.6
- webencodings 0.5.1
- SPARQLWrapper ^2.0.0
- click ^8.1.3
- pyshacl ^0.22.2
- python ^3.11
- pyyaml ^6.0
- rdflib ^6.3.2
- requests ^2.29.0