qualitymetadataspatiallandmarkdataset

Metadata describing and assessing the quality of spatial landmark datasets in mountain area

https://github.com/anrchoucas/qualitymetadataspatiallandmarkdataset

Science Score: 49.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
    Found 2 DOI reference(s) in README
  • Academic publication links
    Links to: zenodo.org
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (6.8%) to scientific vocabulary
Last synced: 6 months ago · JSON representation

Repository

Metadata describing and assessing the quality of spatial landmark datasets in mountain area

Basic Info
  • Host: GitHub
  • Owner: ANRChoucas
  • License: cc0-1.0
  • Language: Java
  • Default Branch: main
  • Size: 173 KB
Statistics
  • Stars: 0
  • Watchers: 2
  • Forks: 1
  • Open Issues: 0
  • Releases: 1
Created almost 4 years ago · Last pushed over 1 year ago
Metadata Files
Readme License Authors Codemeta

README.md

QualityMetadataSpatialLandmarkDataset

This project contains the procedures to reproduce the results of the paper A method to produce metadata describing and assessing the quality of spatial landmark datasets in mountain area, M.-D. Van Damme, A.-M. Olteanu Raimond

SWH SWH

README Contents - Development & Contributions - Procedure n°1 to reproduce table 3 (row Uncertainty) and table 4 * Loading data * Reproduction of the row Uncertainty of table 3 * Reproduction of table 4 - Procedure n°2 to reproduce data matching and then to have the results of a section of table 3 * Loading data - Procedure n°3 to reproduce boxplot of Samal distance for names in Refuges.info source * Loading data * Generate the distribution for the samal distance * Create the boxplot

Development & Contributions

  • Institute: LASTIG, Univ Gustave Eiffel, ENSG, IGN
  • License: CC0-1.0 license
  • Authors:
    • Marie-Dominique Van Damme
    • Ana-Maria Raimond


Procedure n°1 to reproduce table 3 (row Uncertainty) and table 4

All the steps described below concern the camptocamp.org data source. To get the results of the other data sources (OpenStreeetMap.org, Refuges.info, rando.ecrins-parcnational.fr and rando.parc-du-vercors.fr), it will be necessary to adapt the link of dataset to download and the table names in the SQL scripts.

Loading data for procedure n°1

These instructions will be executed before the first or the second reproducing that follow.

  • Input data :

  • Coding environnement: PostGreSQL/POSTGIS

  • Steps to follow

    • Step 1: create a database in PostGreSQL: sql CREATE DATABASE agile_metadata_2022
    • Step 2: install postgis extension for this new database (see Extension menu)
    • Step 3: import all the needed data in the postgres database: run the SQL script sql/0loadingdata.sql


Reproduction of the row Uncertainty of table 3

  • Run the first request in the script SQL sql/1_confidence.sql to get the DQ_confidence for all the scope.

  • Note: the two other scripts compute the DQ_confidence for a subset of the types. This is an example for on demand metadata; for example if the user needs to assess only the confidence of the matching algorithm for a specific types of landmarks (e.g. those corresponding the the ontology class "isolated accomodation")


Reproduction of table 4

MeanAdsolute2D, RootMeanSquareError, and AgreementRate Threshold rows

  • Run the script SQL sql/2spatialaccuracy.sql

Overall accuracy and Confusion Matrix rows

  • Run the script SQL sql/3confusionmatrix_all.sql
  • Import the result in a tabular software like Excel or OpenOffice
  • Create a cross table: the values of the first column correspond to the line, the values of the second column correspond to the column and the values of the third column correspond to the quantitative values of the cross table
  • Several couple of values, not in the diagonal, are correctly classified items:
    (lieu-dit, col), (lieu-dit, croix), (lieu-dit, massifboisé), (lieu-dit, rocher), (lieu-dit, surfaceneigeetglace), (lieu-dit, vallée), (lieu-dit, abri), (lieu-dit, hébergementisolé), (hébergementisolé, abri), (hébergementaccessible, gîte), (hébergementisolé, refuge), (gite, refuge), (abri, refuge), ((vide), lac), (dépression_fermée, grotte) these pairs are used also to compute overall accuracy.
  • The overall accuracy is the sum of items on the main diagonal + items correctly classified) divided by the sum of all items from the matrix

Duplicate row

  • Run the script SQL sql/4duplicateall by changing each time the name of the dataset table

Non quantitative attribute accuracy measures

  • Run the script SQL sql/5Samaldistance.sql

Missing class "nom"

  • Run the script SQL sql/6missingclass.sql

Excess and Missing items rows

  • You have to create a worksheet in a tabular software like Excel or OpenOffice

  • Prepare the worksheet by creating these columns:

GitHub Logo

  • Run each request in the script SQL sql/7_completeness.sql and put the result column per column

  • Calculate the sum of elements in each column, for example the sum are stored in line 115.

  • Then, you have:

    • Excess = (B115+C115)/E115
    • Missing items = F115/H115


Procedure n°2 to reproduce data matching and then to have the results of a section of table 3

Loading data for procedure n°2

  1. Input ressources:
  1. The current Java project “QualityMetadataSpatialLandmarkDataset”. There is not need You don’t need to install MultiCriteriaMatching code. It is a depedency library of the project QualityMetadataSpatialLandmarkDataset (maven project).

Step n°1: installation of the Java project

  • Java Install:
    • Download and install the Java Development Kit (JDK) (jdk 8) from the Oracle website
  • Eclipse
    • Download and install the IDE Eclipse
  • Download the project QualityMetadataSpatialLandmarkDataset on your local system
  • Import the project in Eclipse like a maven project

Step n°2: import dataset files in the project

  • drop the six landmark dataset files in the data/dataset folder
  • drop the five alignements files in the data/alignment folder

Step n°3, launch data matching script

Launch the Java main file MainMatchingCamptocampBdtopo.java. This program loads data and computes the matching links between the sources of datasets and the BDTOPO dataset.

At the end of the computation, the data are matched and the results are stored as a CSV file (e.g. c2c-bdtopo-XXX.csv”) in the resultat folder. Note : rows one (1:0) and two (1:1) of Table 3 are obtained directly from the Java console print

Step n°4, Validate the data matching results (manually)

  1. Launch QGIs
  2. Install the plugIn
    • Copy the visuvalideMultiCriteriaMatching plugIn in the QGIS folder
    • Open QGIS;
    • In the QGIS Extensions look for the visuvalideMultiCriteriaMatching plugIn; a small icon is added on the QGIS ‘s plugIn toolbar
  3. Execute the plugIn visuvalideMultiCriteriaMatching by click on the button in toolbar
  4. Import the file creating in the step before.
  5. Validate landmark by landmark
  6. The results are store in a shapefile

Procedure n°3 to reproduce boxplot of Samal distance for names in Refuges.info source

Loading data for procedure n°3

  • Input data :

  • Coding environnement: PostGreSQL/POSTGIS

  • Steps to follow

    • Step 1: create a database in PostGreSQL: sql CREATE DATABASE agile_metadata_2022
    • Step 2: import all the needed data in the postgres database: run the SQL script sql/p30loading_data.sql


Generate the distribution for the samal distance

  • Run the SQL script p31boxplotsamaldistance.sql

  • Export query result to .csv file, for example distances_samal.csv


Create the boxplot

  • This is an example to create a boxplot with R software: sql x <- read.csv("/home/glagaffe/distsamal.csv",header=T, sep=",") boxplot(x, xlab="Refuges.info", ylab="Samal distance", main="Boxplot of Samal distance for names in Refuges.info source")

Owner

  • Name: CHOUCAS
  • Login: ANRChoucas
  • Kind: organization

CodeMeta (codemeta.json)

{
  "@context": "https://doi.org/10.5063/schema/codemeta-2.0",
  "type": "SoftwareSourceCode",
  "author": [
    {
      "id": "https://orcid.org/0000-0002-0007-5972",
      "type": "Person",
      "affiliation": {
        "type": "Organization",
        "name": "LASTIG, Univ Gustave Eiffel, IGN-ENSG"
      },
      "email": "marie-dominique.van-damme@ensg.eu",
      "familyName": "VAN DAMME",
      "givenName": "Marie-Dominique"
    },
    {
      "type": "schema:Role",
      "schema:author": "https://orcid.org/0000-0002-0007-5972",
      "schema:roleName": "Developer"
    },
    {
      "id": "https://orcid.org/0000-0002-1101-1333",
      "type": "Person",
      "affiliation": {
        "type": "Organization",
        "name": "LASTIG, Univ Gustave Eiffel, IGN-ENSG"
      },
      "email": "ana-maria.raimond@ign.fr",
      "familyName": "Olteanu-Raimond",
      "givenName": "Ana-Maria"
    },
    {
      "type": "schema:Role",
      "schema:author": "https://orcid.org/0000-0002-1101-1333",
      "schema:roleName": "Developer"
    }
  ],
  "codeRepository": "https://github.com/ANRChoucas/QualityMetadataSpatialLandmarkDataset.git",
  "dateCreated": "2022-01-01",
  "dateModified": "2022-06-10",
  "description": "This project contains the procedures to reproduce the results of the paper: Van Damme, M.-D. and Olteanu-Raimond, A.-M.: A method to produce metadata describing and assessing the quality of spatial landmark datasets in mountain area, AGILE GIScience Ser., 3, 17, https://doi.org/10.5194/agile-giss-3-17-2022, 2022",
  "downloadUrl": "https://github.com/ANRChoucas/QualityMetadataSpatialLandmarkDataset/archive/refs/tags/v1.0.0.zip",
  "license": "https://spdx.org/licenses/CC0-1.0",
  "name": "QualityMetadataSpatialLandmarkDataset",
  "operatingSystem": "All",
  "programmingLanguage": [
    "Java",
    "SQL"
  ],
  "schema:releaseNotes": "First version, version for the reproducibility process",
  "runtimePlatform": "JVM",
  "softwareRequirements": "Postgres",
  "version": "1.0.0",
  "developmentStatus": "active"
}

GitHub Events

Total
Last Year

Dependencies

pom.xml maven
  • com.google.code.gson:gson 2.2.4
  • com.googlecode.json-simple:json-simple 1.1
  • com.vividsolutions:jts-io 1.14.0
  • de.siegmar:fastcsv 2.0.0
  • fr.ign.cogit:MultiCriteriaMatching 1.0-SNAPSHOT
  • jakarta.xml.bind:jakarta.xml.bind-api 3.0.0
  • org.apache.httpcomponents:httpmime 4.5.6
  • org.apache.sis.core:sis-feature 1.1
  • org.apache.sis.core:sis-metadata 1.1-M1
  • org.apache.sis.core:sis-referencing 1.1-M1
  • org.apache.sis.core:sis-utility 1.1-M1
  • org.apache.sis.non-free:sis-embedded-data 1.0
  • org.apache.sis.profiles:sis-french-profile 1.1
  • org.apache.sis.storage:sis-storage 1.1
  • org.geotools:gt-opengis 8.4
  • org.geotools:gt-referencing 8.4
  • org.json:json 20180130
  • org.locationtech.jts:jts-core 1.16.1
  • junit:junit 4.10 test