rdf-property-and-datatype-usage-scanner

A tool to survey the usage of properties and datatypes in RDF dataset files (as the Web Data Commons) on the web.

https://github.com/fusion-jena/rdf-property-and-datatype-usage-scanner

Science Score: 52.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
    Organization fusion-jena has institutional domain (fusion.cs.uni-jena.de)
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (5.4%) to scientific vocabulary
Last synced: 10 months ago · JSON representation ·

Repository

A tool to survey the usage of properties and datatypes in RDF dataset files (as the Web Data Commons) on the web.

Basic Info
  • Host: GitHub
  • Owner: fusion-jena
  • License: mit
  • Language: Java
  • Default Branch: main
  • Homepage:
  • Size: 233 KB
Statistics
  • Stars: 0
  • Watchers: 4
  • Forks: 0
  • Open Issues: 0
  • Releases: 4
Created over 4 years ago · Last pushed almost 2 years ago
Metadata Files
Readme License Citation

readme.md

RDF Property and Datatype Usage Scanner

This is a tool to survey the usage of properties and datatypes in RDF dataset files (as the Web Data Commons) on the web.

Usage

How to start the experiment: 1. Preparation: Clone the repository and run mvn clean package to build the tool. 2. Run: * java -jar target/Scanner.jar --category <category_name> --list <list_file_url> <database_folder> to schedule files to scan * ./measure.sh <database_folder> to start the measure in background * java -jar target/Scanner.jar --results <result_file.csv.gz> <database_folder> to export measurements as a gzipped CSV * java -jar target/Scanner.jar --errors <error_file.csv.gz> <database_folder> to export errors as a gzipped CSV

For automatic restart on Linux edit crontab using crontab -e and add the line: @reboot bash <path to script>/measure.sh <database_folder> (keep an empty line at the end of the document)

Owner

  • Name: FUSION
  • Login: fusion-jena
  • Kind: organization
  • Location: Jena, Germany

Heinz Nixdorf Chair for Distributed Information Systems, Friedrich Schiller University Jena

Citation (CITATION.cff)

cff-version: 1.2.0
title: RDF Property and Datatype Usage Scanner
message: 'If you use this software, please cite it as below.'
type: software
authors:
  - given-names: Merle
    family-names: Gänßinger
    orcid: 'https://orcid.org/0000-0003-4481-069X'
  - given-names: Jan Martin
    family-names: Keil
    orcid: 'https://orcid.org/0000-0002-7733-0193'
license: MIT License

GitHub Events

Total
  • Pull request event: 1
Last Year
  • Pull request event: 1

Dependencies

pom.xml maven
  • com.h2database:h2 2.1.210
  • info.picocli:picocli 4.6.3
  • org.apache.commons:commons-lang3 3.0
  • org.apache.jena:apache-jena-libs 4.4.0
  • org.slf4j:slf4j-log4j12 2.0.0-alpha6
  • org.junit.jupiter:junit-jupiter-api 5.5.1 test
  • org.junit.jupiter:junit-jupiter-engine 5.5.1 test