Science Score: 31.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
  • DOI references
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (6.5%) to scientific vocabulary
Last synced: 10 months ago · JSON representation ·

Repository

Basic Info
  • Host: GitHub
  • Owner: edbennett
  • License: mit
  • Language: Python
  • Default Branch: main
  • Size: 487 KB
Statistics
  • Stars: 0
  • Watchers: 1
  • Forks: 0
  • Open Issues: 0
  • Releases: 1
Created almost 2 years ago · Last pushed almost 2 years ago
Metadata Files
Readme License Citation

README.txt

zipf analysis
=============

This repository contains code to observe whether books adhere to Zipf's law, as
done in support of the paper "Zipf analysis of 19th-century English-language books",
V. Dracula, to appear in Annals of Computational Linguistics, 2022.

To run the code, you will need the Pandas package installed.

To reproduce the figures in the publication, run the command:

    bash bin/run_analysis.sh

This script will automatically pull the full text of the two books to
process (Frankenstein and Dracula) from Project Gutenberg (gutenberg.org) and place
them into the `data` directory. Internet access is required for this to work.

Results will be placed in a `results/` directory.

Owner

  • Name: Ed Bennett
  • Login: edbennett
  • Kind: user
  • Location: Swansea, UK
  • Company: Swansea Academy of Advanced Computing, Swansea University

Citation (CITATION)

If you use this code in your own work or research presented in a publication, we ask
that you please cite:

"Zipf analysis of 19th-century English-language books",
V. Dracula, to appear in Annals of Computational Linguistics, 2022

GitHub Events

Total
Last Year

Dependencies

environment.yml conda
  • blas 1.0.*
  • brotli 1.0.9.*
  • brotli-bin 1.0.9.*
  • bzip2 1.0.8.*
  • c-ares 1.19.1.*
  • ca-certificates 2024.7.2.*
  • curl 8.7.1.*
  • cycler 0.11.0.*
  • freetype 2.12.1.*
  • intel-openmp 2023.1.0.*
  • jpeg 9e.*
  • krb5 1.20.1.*
  • lcms2 2.12.*
  • lerc 3.0.*
  • libbrotlicommon 1.0.9.*
  • libbrotlidec 1.0.9.*
  • libbrotlienc 1.0.9.*
  • libcurl 8.7.1.*
  • libcxx 14.0.6.*
  • libdeflate 1.17.*
  • libedit 3.1.20230828.*
  • libev 4.33.*
  • libffi 3.4.4.*
  • libnghttp2 1.57.0.*
  • libpng 1.6.39.*
  • libssh2 1.11.0.*
  • libtiff 4.5.1.*
  • libwebp-base 1.3.2.*
  • lz4-c 1.9.4.*
  • matplotlib-base 3.8.4.*
  • mkl 2023.1.0.*
  • mkl_fft 1.3.8.*
  • mkl_random 1.2.4.*
  • ncurses 6.4.*
  • numpy-base 1.26.4.*
  • openjpeg 2.4.0.*
  • openssl 3.0.14.*
  • python 3.11.9.*
  • python-tzdata 2023.3.*
  • readline 8.2.*
  • six 1.16.0.*
  • sqlite 3.45.3.*
  • tbb 2021.8.0.*
  • tk 8.6.14.*
  • tzdata 2024a.*
  • xz 5.4.6.*
  • zlib 1.2.13.*
  • zstd 1.5.5.*