https://github.com/chainsawriot/ecosent

Data & Analysis compendium for the Economic Sentiment analysis paper

https://github.com/chainsawriot/ecosent

Science Score: 13.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
  • .zenodo.json file
  • DOI references
    Found 1 DOI reference(s) in README
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (6.8%) to scientific vocabulary
Last synced: 10 months ago · JSON representation

Repository

Data & Analysis compendium for the Economic Sentiment analysis paper

Basic Info
  • Host: GitHub
  • Owner: chainsawriot
  • License: mit
  • Default Branch: master
  • Size: 49.1 MB
Statistics
  • Stars: 0
  • Watchers: 0
  • Forks: 0
  • Open Issues: 0
  • Releases: 0
Fork of vanatteveldt/ecosent
Created over 1 year ago · Last pushed over 3 years ago

https://github.com/chainsawriot/ecosent/blob/master/

# Economic Sentiment
Data & Analysis compendium for the Economic Sentiment analysis paper.

# Article

View the [article](https://www.tandfonline.com/doi/pdf/10.1080/19312458.2020.1869198?needAccess=true) (open access).

# Code

The main analysis code is located in the [src/data-processing](src/data-processing). 
Of interest might be:

* [10_apply_dictionaries.py](src/data-processing/) Sentiment Dictionaries (Python)
* [11_apply_dictionaries_quanteda.R](src/data-processing/11_apply_dictionaries_quanteda.R) Sentiment Dictionaries (Quanteda)
* [20_svm.py](src/data-processing/20_svm.py) Support Vector Machines
* [22_nb.py](src/data-processing/22_nb.py) Naive Bayes
* [40_cnn_predict.py](src/data-processing/40_cnn_predict.py) Convolutional Neural Network

# Data

The following data files might be of interest:

* [metadata.csv](data/intermediate/metadata.csv) Headline, date, and source of each article
* [gold.csv](data/intermediate/gold.csv) Gold standard (expert) coding
* [crowdcodings.csv](data/intermediate/crowdcodings.csv) Crowd codings 
* [manual_coding.csv](data/intermediate/manual_coding.csv) Manual (student) coding of headlines

See the files in [src/data-processing](src/data-processing) for details on how these files were constructed.

# Results as presented in the article

* [Performance, learning curve, correlations](src/analysis/performance.md)

# Online Appendix / Supplementary Results

* [Crowd coding instructions](report/crowdcoding.md)
* [Confusion Matrices of all methods](src/analysis/confusion_matrix.md)
* [Grid Search for CNN](src/analysis/cnn_gridsearch.md) (code: [30_cnn_gridsearch.py](src/data-processing/30_cnn_gridsearch.py)
* [Grid Search for SVM](src/analysis/svm_gridsearch.md) (code: [19_svm_gridsearch.py](src/data-processing/19_svm_gridsearch.py)
* [Validation of ML results against student codings](src/analysis/ml_versus_students.md)
* [Error analysis](src/analysis/error_analysis.md)

Owner

  • Login: chainsawriot
  • Kind: user
  • Location: Germany
  • Company: @gesistsa

GitHub Events

Total
Last Year