rdf-property-and-datatype-usage-scanner
A tool to survey the usage of properties and datatypes in RDF dataset files (as the Web Data Commons) on the web.
https://github.com/fusion-jena/rdf-property-and-datatype-usage-scanner
Science Score: 52.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
○DOI references
-
○Academic publication links
-
○Academic email domains
-
✓Institutional organization owner
Organization fusion-jena has institutional domain (fusion.cs.uni-jena.de) -
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (5.4%) to scientific vocabulary
Repository
A tool to survey the usage of properties and datatypes in RDF dataset files (as the Web Data Commons) on the web.
Basic Info
Statistics
- Stars: 0
- Watchers: 4
- Forks: 0
- Open Issues: 0
- Releases: 4
Metadata Files
readme.md
RDF Property and Datatype Usage Scanner
This is a tool to survey the usage of properties and datatypes in RDF dataset files (as the Web Data Commons) on the web.
Usage
How to start the experiment:
1. Preparation: Clone the repository and run mvn clean package to build the tool.
2. Run:
* java -jar target/Scanner.jar --category <category_name> --list <list_file_url> <database_folder> to schedule files to scan
* ./measure.sh <database_folder> to start the measure in background
* java -jar target/Scanner.jar --results <result_file.csv.gz> <database_folder> to export measurements as a gzipped CSV
* java -jar target/Scanner.jar --errors <error_file.csv.gz> <database_folder> to export errors as a gzipped CSV
For automatic restart on Linux edit crontab using crontab -e and add the line: @reboot bash <path to script>/measure.sh <database_folder> (keep an empty line at the end of the document)
Owner
- Name: FUSION
- Login: fusion-jena
- Kind: organization
- Location: Jena, Germany
- Website: https://fusion.cs.uni-jena.de/
- Twitter: fusionUniJena
- Repositories: 69
- Profile: https://github.com/fusion-jena
Heinz Nixdorf Chair for Distributed Information Systems, Friedrich Schiller University Jena
Citation (CITATION.cff)
cff-version: 1.2.0
title: RDF Property and Datatype Usage Scanner
message: 'If you use this software, please cite it as below.'
type: software
authors:
- given-names: Merle
family-names: Gänßinger
orcid: 'https://orcid.org/0000-0003-4481-069X'
- given-names: Jan Martin
family-names: Keil
orcid: 'https://orcid.org/0000-0002-7733-0193'
license: MIT License
GitHub Events
Total
- Pull request event: 1
Last Year
- Pull request event: 1
Dependencies
- com.h2database:h2 2.1.210
- info.picocli:picocli 4.6.3
- org.apache.commons:commons-lang3 3.0
- org.apache.jena:apache-jena-libs 4.4.0
- org.slf4j:slf4j-log4j12 2.0.0-alpha6
- org.junit.jupiter:junit-jupiter-api 5.5.1 test
- org.junit.jupiter:junit-jupiter-engine 5.5.1 test