toflit18_data

Datapackage for TOFLIT18 research project

https://github.com/medialab/toflit18_data

Science Score: 49.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
    Found 10 DOI reference(s) in README
  • Academic publication links
    Links to: zenodo.org
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (12.3%) to scientific vocabulary
Last synced: 6 months ago · JSON representation

Repository

Datapackage for TOFLIT18 research project

Basic Info
  • Host: GitHub
  • Owner: medialab
  • Language: Stata
  • Default Branch: master
  • Homepage:
  • Size: 11.8 GB
Statistics
  • Stars: 5
  • Watchers: 22
  • Forks: 1
  • Open Issues: 19
  • Releases: 4
Created about 11 years ago · Last pushed 7 months ago
Metadata Files
Readme Zenodo

README.md

DOI

README.md

December 21st 2022 version

Welcome to the data repository of the Toflit18 project! More details on the project can be found on its research blog: https://toflit18.hypotheses.org/ Details on the funding, partners, contributors, etc. can be found here: http://toflit18.medialab.sciences-po.fr/#/about The main research tool we provide to researchers is the datascape: http://toflit18.medialab.sciences-po.fr/#/home There is a companion GitHub repository dealing with the software aspects of the datascape (https://github.com/medialab/toflit18).

All the data are encoded in UTF-8, comma-separated. We recommend working with them using LibreOffice.

The data are released under datapackage format.

The data are released under an ODbl licence : http://opendatacommons.org/licenses/odbl/1.0/

Sources

The folder "source" includes sources used in the project as csv files. For more details on the different types of sources, see http://toflit18.medialab.sciences-po.fr/#/exploration/sources

Raw aggregated data

  • The folder "base" has all the data, classification and various files. These include:
    • bdd_centrale.csv.zip which is a aggregation of all sources (except the "Out" ones"). This is the go-to file if you want the latest, raw version of the data.
    • documentation about all variables in bdd_centrale.csv is available in the file "Variables explanation.csv"

Relational database

  • We provide you basically with all the necessary files to do a relational database. We work with it ourselves with the scripts included in the folder "scripts". Our workflow creates Stata and Neo4J output.
  • documentation about the classifications is available in the file "classificationsindex.csv". Do not hesitate to contribute new ones, starting from "marchandisespournouvelleclassification.csv"
  • bddcourante.csv.zip is the "flat file" build around bddcentrale.csv. This includes all the classifications, some computations for imputed value of flow and valueperunit, best guess sources etc. This is the go-to file if you want the lastet, cleaned and enriched version of the data

Contact us

For history related issues, ask Loc Charles (lcharles02@univ-paris8.fr) or Guillaume Daudin (guillaume.daudin@dauphine.psl.eu)

For basic guidance in using these ressources, ask Guillaume Daudin (guillaume.daudin@dauphine.psl.eu)

For advanced technical issues, ask Paul Girard (paul@ouestware.com)

How to cite the dataset

This dataset is published on Zenodo as DOI

You can cite it thusly:

Guillaume Daudin, Loc Charles, Pierre Gervais, Paul Girard, & Guillaume Plique. (2022). TOFLIT18 dataset (1.0.0-zenodo) [Data set]. Zenodo. https://doi.org/10.5281/zenodo.6573397

Owner

  • Name: médialab Sciences Po
  • Login: medialab
  • Kind: organization
  • Location: Paris, France

SciencesPo's médialab is an interdisciplinary research laboratory gathering engineers, designers & social science researchers.

GitHub Events

Total
  • Issues event: 1
  • Watch event: 1
  • Member event: 1
  • Push event: 43
Last Year
  • Issues event: 1
  • Watch event: 1
  • Member event: 1
  • Push event: 43

Issues and Pull Requests

Last synced: 7 months ago

All Time
  • Total issues: 37
  • Total pull requests: 3
  • Average time to close issues: 8 months
  • Average time to close pull requests: less than a minute
  • Total issue authors: 5
  • Total pull request authors: 1
  • Average comments per issue: 1.7
  • Average comments per pull request: 0.0
  • Merged pull requests: 3
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 0
  • Pull requests: 1
  • Average time to close issues: N/A
  • Average time to close pull requests: less than a minute
  • Issue authors: 0
  • Pull request authors: 1
  • Average comments per issue: 0
  • Average comments per pull request: 0.0
  • Merged pull requests: 1
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
  • gdaudin (14)
  • Yomguithereal (6)
  • paulgirard (4)
  • robindemourat (1)
  • roll (1)
Pull Request Authors
  • Yomguithereal (3)
Top Labels
Issue Labels
bug (6) question (3) reflexion (2) enhancement (2)
Pull Request Labels

Dependencies

Dockerfile docker
  • neo4j 3.5.16 build