qualitymetadataspatiallandmarkdataset
Metadata describing and assessing the quality of spatial landmark datasets in mountain area
https://github.com/anrchoucas/qualitymetadataspatiallandmarkdataset
Science Score: 49.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
○CITATION.cff file
-
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
✓DOI references
Found 2 DOI reference(s) in README -
✓Academic publication links
Links to: zenodo.org -
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (6.8%) to scientific vocabulary
Repository
Metadata describing and assessing the quality of spatial landmark datasets in mountain area
Basic Info
- Host: GitHub
- Owner: ANRChoucas
- License: cc0-1.0
- Language: Java
- Default Branch: main
- Size: 173 KB
Statistics
- Stars: 0
- Watchers: 2
- Forks: 1
- Open Issues: 0
- Releases: 1
Metadata Files
README.md
QualityMetadataSpatialLandmarkDataset
This project contains the procedures to reproduce the results of the paper A method to produce metadata describing and assessing the quality of spatial landmark datasets in mountain area, M.-D. Van Damme, A.-M. Olteanu Raimond
README Contents - Development & Contributions - Procedure n°1 to reproduce table 3 (row Uncertainty) and table 4 * Loading data * Reproduction of the row Uncertainty of table 3 * Reproduction of table 4 - Procedure n°2 to reproduce data matching and then to have the results of a section of table 3 * Loading data - Procedure n°3 to reproduce boxplot of Samal distance for names in Refuges.info source * Loading data * Generate the distribution for the samal distance * Create the boxplot
Development & Contributions
- Institute: LASTIG, Univ Gustave Eiffel, ENSG, IGN
- License: CC0-1.0 license
- Authors:
- Marie-Dominique Van Damme
- Ana-Maria Raimond
Procedure n°1 to reproduce table 3 (row Uncertainty) and table 4
All the steps described below concern the camptocamp.org data source. To get the results of the other data sources (OpenStreeetMap.org, Refuges.info, rando.ecrins-parcnational.fr and rando.parc-du-vercors.fr), it will be necessary to adapt the link of dataset to download and the table names in the SQL scripts.
Loading data for procedure n°1
These instructions will be executed before the first or the second reproducing that follow.
Input data :
- initial datasets:
- data matching links results: matchingresultCamptocampandBDTOPO.csv
- alignement file between the dataset and the OOR ontology: AlignmentCamptocampOOR.csv
Coding environnement: PostGreSQL/POSTGIS
Steps to follow
- Step 1: create a database in PostGreSQL:
sql CREATE DATABASE agile_metadata_2022 - Step 2: install postgis extension for this new database (see Extension menu)
- Step 3: import all the needed data in the postgres database: run the SQL script sql/0loadingdata.sql
- Step 1: create a database in PostGreSQL:
Reproduction of the row Uncertainty of table 3
Run the first request in the script SQL sql/1_confidence.sql to get the DQ_confidence for all the scope.
Note: the two other scripts compute the DQ_confidence for a subset of the types. This is an example for on demand metadata; for example if the user needs to assess only the confidence of the matching algorithm for a specific types of landmarks (e.g. those corresponding the the ontology class "isolated accomodation")
Reproduction of table 4
MeanAdsolute2D, RootMeanSquareError, and AgreementRate Threshold rows
- Run the script SQL sql/2spatialaccuracy.sql
Overall accuracy and Confusion Matrix rows
- Run the script SQL sql/3confusionmatrix_all.sql
- Import the result in a tabular software like Excel or OpenOffice
- Create a cross table: the values of the first column correspond to the line, the values of the second column correspond to the column and the values of the third column correspond to the quantitative values of the cross table
- Several couple of values, not in the diagonal, are correctly classified items:
(lieu-dit, col), (lieu-dit, croix), (lieu-dit, massifboisé), (lieu-dit, rocher), (lieu-dit, surfaceneigeetglace), (lieu-dit, vallée), (lieu-dit, abri), (lieu-dit, hébergementisolé), (hébergementisolé, abri), (hébergementaccessible, gîte), (hébergementisolé, refuge), (gite, refuge), (abri, refuge), ((vide), lac), (dépression_fermée, grotte) these pairs are used also to compute overall accuracy. - The overall accuracy is the sum of items on the main diagonal + items correctly classified) divided by the sum of all items from the matrix
Duplicate row
- Run the script SQL sql/4duplicateall by changing each time the name of the dataset table
Non quantitative attribute accuracy measures
- Run the script SQL sql/5Samaldistance.sql
Missing class "nom"
- Run the script SQL sql/6missingclass.sql
Excess and Missing items rows
You have to create a worksheet in a tabular software like Excel or OpenOffice
Prepare the worksheet by creating these columns:

Run each request in the script SQL sql/7_completeness.sql and put the result column per column
Calculate the sum of elements in each column, for example the sum are stored in line 115.
Then, you have:
- Excess = (B115+C115)/E115
- Missing items = F115/H115
Procedure n°2 to reproduce data matching and then to have the results of a section of table 3
Loading data for procedure n°2
- Input ressources:
- The six dataset files : Five spatial landmark datasets" downloaded on the plateform Zenodo (version 1.0)
- The five files : Alignment between type of landmark in different sources and the concept in the spatial reference objects ontology
- The current Java project “QualityMetadataSpatialLandmarkDataset”. There is not need You don’t need to install MultiCriteriaMatching code. It is a depedency library of the project QualityMetadataSpatialLandmarkDataset (maven project).
Step n°1: installation of the Java project
- Java Install:
- Download and install the Java Development Kit (JDK) (jdk 8) from the Oracle website
- Eclipse
- Download and install the IDE Eclipse
- Download the project QualityMetadataSpatialLandmarkDataset on your local system
- Import the project in Eclipse like a maven project
Step n°2: import dataset files in the project
- drop the six landmark dataset files in the data/dataset folder
- drop the five alignements files in the data/alignment folder
Step n°3, launch data matching script
Launch the Java main file MainMatchingCamptocampBdtopo.java. This program loads data and computes the matching links between the sources of datasets and the BDTOPO dataset.
At the end of the computation, the data are matched and the results are stored as a CSV file (e.g. c2c-bdtopo-XXX.csv”) in the resultat folder. Note : rows one (1:0) and two (1:1) of Table 3 are obtained directly from the Java console print
Step n°4, Validate the data matching results (manually)
- Launch QGIs
- Install the plugIn
- Copy the visuvalideMultiCriteriaMatching plugIn in the QGIS folder
- Open QGIS;
- In the QGIS Extensions look for the visuvalideMultiCriteriaMatching plugIn; a small icon is added on the QGIS ‘s plugIn toolbar
- Execute the plugIn visuvalideMultiCriteriaMatching by click on the button in toolbar
- Import the file creating in the step before.
- Validate landmark by landmark
- The results are store in a shapefile
Procedure n°3 to reproduce boxplot of Samal distance for names in Refuges.info source
Loading data for procedure n°3
Input data :
- data matching links results: matchingresultRefugesInfoandBDTOPO.csv
Coding environnement: PostGreSQL/POSTGIS
Steps to follow
- Step 1: create a database in PostGreSQL:
sql CREATE DATABASE agile_metadata_2022 - Step 2: import all the needed data in the postgres database: run the SQL script sql/p30loading_data.sql
- Step 1: create a database in PostGreSQL:
Generate the distribution for the samal distance
Run the SQL script p31boxplotsamaldistance.sql
Export query result to .csv file, for example distances_samal.csv
Create the boxplot
- This is an example to create a boxplot with R software:
sql x <- read.csv("/home/glagaffe/distsamal.csv",header=T, sep=",") boxplot(x, xlab="Refuges.info", ylab="Samal distance", main="Boxplot of Samal distance for names in Refuges.info source")
Owner
- Name: CHOUCAS
- Login: ANRChoucas
- Kind: organization
- Website: http://choucas.ign.fr/
- Repositories: 18
- Profile: https://github.com/ANRChoucas
CodeMeta (codemeta.json)
{
"@context": "https://doi.org/10.5063/schema/codemeta-2.0",
"type": "SoftwareSourceCode",
"author": [
{
"id": "https://orcid.org/0000-0002-0007-5972",
"type": "Person",
"affiliation": {
"type": "Organization",
"name": "LASTIG, Univ Gustave Eiffel, IGN-ENSG"
},
"email": "marie-dominique.van-damme@ensg.eu",
"familyName": "VAN DAMME",
"givenName": "Marie-Dominique"
},
{
"type": "schema:Role",
"schema:author": "https://orcid.org/0000-0002-0007-5972",
"schema:roleName": "Developer"
},
{
"id": "https://orcid.org/0000-0002-1101-1333",
"type": "Person",
"affiliation": {
"type": "Organization",
"name": "LASTIG, Univ Gustave Eiffel, IGN-ENSG"
},
"email": "ana-maria.raimond@ign.fr",
"familyName": "Olteanu-Raimond",
"givenName": "Ana-Maria"
},
{
"type": "schema:Role",
"schema:author": "https://orcid.org/0000-0002-1101-1333",
"schema:roleName": "Developer"
}
],
"codeRepository": "https://github.com/ANRChoucas/QualityMetadataSpatialLandmarkDataset.git",
"dateCreated": "2022-01-01",
"dateModified": "2022-06-10",
"description": "This project contains the procedures to reproduce the results of the paper: Van Damme, M.-D. and Olteanu-Raimond, A.-M.: A method to produce metadata describing and assessing the quality of spatial landmark datasets in mountain area, AGILE GIScience Ser., 3, 17, https://doi.org/10.5194/agile-giss-3-17-2022, 2022",
"downloadUrl": "https://github.com/ANRChoucas/QualityMetadataSpatialLandmarkDataset/archive/refs/tags/v1.0.0.zip",
"license": "https://spdx.org/licenses/CC0-1.0",
"name": "QualityMetadataSpatialLandmarkDataset",
"operatingSystem": "All",
"programmingLanguage": [
"Java",
"SQL"
],
"schema:releaseNotes": "First version, version for the reproducibility process",
"runtimePlatform": "JVM",
"softwareRequirements": "Postgres",
"version": "1.0.0",
"developmentStatus": "active"
}
GitHub Events
Total
Last Year
Dependencies
- com.google.code.gson:gson 2.2.4
- com.googlecode.json-simple:json-simple 1.1
- com.vividsolutions:jts-io 1.14.0
- de.siegmar:fastcsv 2.0.0
- fr.ign.cogit:MultiCriteriaMatching 1.0-SNAPSHOT
- jakarta.xml.bind:jakarta.xml.bind-api 3.0.0
- org.apache.httpcomponents:httpmime 4.5.6
- org.apache.sis.core:sis-feature 1.1
- org.apache.sis.core:sis-metadata 1.1-M1
- org.apache.sis.core:sis-referencing 1.1-M1
- org.apache.sis.core:sis-utility 1.1-M1
- org.apache.sis.non-free:sis-embedded-data 1.0
- org.apache.sis.profiles:sis-french-profile 1.1
- org.apache.sis.storage:sis-storage 1.1
- org.geotools:gt-opengis 8.4
- org.geotools:gt-referencing 8.4
- org.json:json 20180130
- org.locationtech.jts:jts-core 1.16.1
- junit:junit 4.10 test