benford_russian_income_surveys

This is the code and data to replicate the analysis in Skougarevskiy (2023)'s "Russian Household Income Surveys: Ensuring Data Quality" paper.

https://github.com/irlcode/benford_russian_income_surveys

Science Score: 57.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
    Found 2 DOI reference(s) in README
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (8.5%) to scientific vocabulary

Keywords

benford-analysis benfords-law household-surveys
Last synced: 6 months ago · JSON representation ·

Repository

This is the code and data to replicate the analysis in Skougarevskiy (2023)'s "Russian Household Income Surveys: Ensuring Data Quality" paper.

Basic Info
  • Host: GitHub
  • Owner: irlcode
  • License: other
  • Language: R
  • Default Branch: main
  • Homepage:
  • Size: 73.8 MB
Statistics
  • Stars: 0
  • Watchers: 1
  • Forks: 1
  • Open Issues: 0
  • Releases: 0
Topics
benford-analysis benfords-law household-surveys
Created over 2 years ago · Last pushed almost 2 years ago
Metadata Files
Readme License Citation Authors

README.md

Data and Code For Russian Household Income Surveys: Ensuring Data Quality

This is the code and data to replicate the analysis in Skougarevskiy (2023)'s "Russian Household Income Surveys: Ensuring Data Quality" paper.

Installation

To replicate the analysis you need to clone this repository to your local machine. Then you need to install the required versions of R dependencies listed in DEPENDENCIES. code/helper_functions/install_dependencies.r automates this step, but you may still need to install the underlying libraries manually with Homebrew or apt-get, depending on your platform. Finally, you need to declare the environment variable BENFORD_RUSSIAN_INCOME_SURVEYS_PATH in bash pointing to the repository. Or, better yet, you can add it in your .Renviron with console user:~$ echo 'BENFORD_RUSSIAN_INCOME_SURVEYS_PATH="path_to_cloned_repository"' >> ~/.Renviron

Repository structure

/ ├── code/ | ├── 1_summary_stat_income_surveys.r # Summary statistics and first-digit distributions | ├── 2a_test_conformity_all_surveys.r # Test all surveys' conformity to Benford's law | ├── 2b_within_rlms_conformity.r # Test RLMS conformity within SSU/new/old respondents | ├── 2c_power_simulations.r # Conduct power simulations of Benford tests | ├── 3_produce_plots_and_tables.r # Produce all Figures and Tables in the paper | ├── helper_functions/ | | ├── install_dependencies.r # Install dependencies | | └── helper_functions.r # Functions performing all statistical tests | └──prepare_surveys/ # Code to prepare survey data | ├── 1a_convert_rlms.r # HSE's Russian longitudinal monitoring survey | ├── 1b_convert_cb_finances_survey.r # Central Bank's Consumer finances survey | ├── 1c_convert_rosstat_surveys_to_rdata.r # Rosstat's VNDN survey | ├── 1d_convert_finmon_to_rdata.r # HSE's FINMON survey | ├── 1e_convert_rcsv_to_rdata.r # EUSPb's Russian Crime Victimization survey | ├── 1f_convert_sipp_survey.r # BLS's Survey of Income and Program Participation | ├── 1j_convert_eusilc_surveys.r # European Union Statistics on Income and Living Conditions surveys | ├── 2_combine_surveys.r # Combines all surveys into survey_data/income_surveys.rdata object | └── 3_identify_new_observations_per_wave.r # Indentify the year when household/individual appeared in RLMS/CB survey ├── survey_data/ | ├── income_surveys.rdata # Data with income and indentifiers from all surveys | └── income_surveys_first_year.rdata # Years when household/individual appeared in RLMS/CB survey ├── output/ | ├── elibrary_papers_mentioning_income_surveys.csv # Papers citing surveys from eLIBRARY.ru (as of August 23, 2023) | ├── income_surveys_digit_distrib.csv # First digit distribution in all surveys | ├── income_surveys_income_conformity_frequentist.rdata # Frequentist Benford test statistics and p-values per survey | ├── income_surveys_income_conformity.rdata # Bayes factors and posterior probability of Benford conformity per survey | ├── income_surveys_summary_stat.csv # Summary statistics per survey | ├── rlms_household_digit_distrib_per_area_id_year.csv # First digit distribution per RLMS secondary sampling unit | ├── rlms_household_income_conformity_by_area_id_year.rdata # Benford conformity per RLMS secondary sampling unit | ├── rlms_household_income_conformity_by_novelty.rdata # Benford conformity by RLMS household first/non-first year in survey | └── vodpf_household_income_conformity_by_novelty.rdata # Benford conformity by CB survey respondent novelty ├── tables/ | ├── recent_income_surveys_digit_distrib.csv # Table 1 | ├── recent_income_surveys_conformity.csv # Table 2 | ├── household_survey_conformity.csv # Table 3, household-level | ├── individual_survey_conformity.csv # Table 3, individual-level | ├── recent_income_surveys_summary_stat.csv # Table A.1 | └── rlms_household_conformers_per_area_id_year.csv # Table A.2 └── figures/ ├── digit_probs_plot.pdf # Figure 1 ├── rlms_conformity_plot.pdf # Figure 2 ├── conformity_novelty_plot.pdf # Figure 3 ├── yearly_survey_mentions_top_journals_plot.pdf # Figure A.1 ├── benford_power_simulations_plot.pdf # Figure A.2 └── rlms_ssu_share_plot.pdf # Figure A.3

Data availability statement

The data that support the findings of this study were derived from the following resources available in the public domain: - RLMS conducted by National Research University "Higher School of Economics" and OOO “Demoscope” together with Carolina Population Center, University of North Carolina at Chapel Hill and the Institute of Sociology of the Federal Center of Theoretical and Applied Sociology of the Russian Academy of Sciences. (RLMS-HSE web sites: https://rlms-hse.cpc.unc.edu, https://www.hse.ru/org/hse/rlms) - VNDN conducted by Rosstat and freely available - VODPF published by the Central Bank of Russia - FINMON published by the Higher School of Economics - RCVS published by the European University at St. Petersburg - SIPP published by the U.S. Bureau of Labor Statistics - EUSILC public use data by the European Commission

Licence

Creative Commons License
Creative Commons License Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).

Copyright © the respective contributors, as shown by the AUTHORS file.

Contacts

Dmitriy Skougarevskiy, Ph.D. dskougarevskiy@eu.spb.ru

Owner

  • Name: Institute for the Rule of Law at European University at St. Petersburg
  • Login: irlcode
  • Kind: organization
  • Email: ipp@eu.spb.ru
  • Location: Saint Petersburg, Russia

Public repository

Citation (CITATION.cff)

cff-version: 1.2.0
message: "If you use this data or code, please cite it as below."
authors:
- family-names: "Skougarevskiy"
  given-names: "Dmitriy"
  orcid: "0000-0002-4022-6210"
title: "Russian Household Income Surveys: Ensuring Data Quality"
date-released: 2023-10-01
year: 2023
url: "https://github.com/irlcode/benford_russian_income_surveys"
preferred-citation:
  type: unpublished
  authors:
  - family-names: "Skougarevskiy"
    given-names: "Dmitriy"
    orcid: "0000-0002-4022-6210"
  url: "https://github.com/irlcode/benford_russian_income_surveys"
  title: "Russian Household Income Surveys: Ensuring Data Quality"
  year: 2023

GitHub Events

Total
Last Year