Science Score: 31.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
○.zenodo.json file
-
○DOI references
-
○Academic publication links
-
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (6.5%) to scientific vocabulary
Last synced: 10 months ago
·
JSON representation
·
Repository
Basic Info
- Host: GitHub
- Owner: edbennett
- License: mit
- Language: Python
- Default Branch: main
- Size: 487 KB
Statistics
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
- Releases: 1
Created almost 2 years ago
· Last pushed almost 2 years ago
Metadata Files
Readme
License
Citation
README.txt
zipf analysis
=============
This repository contains code to observe whether books adhere to Zipf's law, as
done in support of the paper "Zipf analysis of 19th-century English-language books",
V. Dracula, to appear in Annals of Computational Linguistics, 2022.
To run the code, you will need the Pandas package installed.
To reproduce the figures in the publication, run the command:
bash bin/run_analysis.sh
This script will automatically pull the full text of the two books to
process (Frankenstein and Dracula) from Project Gutenberg (gutenberg.org) and place
them into the `data` directory. Internet access is required for this to work.
Results will be placed in a `results/` directory.
Owner
- Name: Ed Bennett
- Login: edbennett
- Kind: user
- Location: Swansea, UK
- Company: Swansea Academy of Advanced Computing, Swansea University
- Repositories: 15
- Profile: https://github.com/edbennett
Citation (CITATION)
If you use this code in your own work or research presented in a publication, we ask that you please cite: "Zipf analysis of 19th-century English-language books", V. Dracula, to appear in Annals of Computational Linguistics, 2022
GitHub Events
Total
Last Year
Dependencies
environment.yml
conda
- blas 1.0.*
- brotli 1.0.9.*
- brotli-bin 1.0.9.*
- bzip2 1.0.8.*
- c-ares 1.19.1.*
- ca-certificates 2024.7.2.*
- curl 8.7.1.*
- cycler 0.11.0.*
- freetype 2.12.1.*
- intel-openmp 2023.1.0.*
- jpeg 9e.*
- krb5 1.20.1.*
- lcms2 2.12.*
- lerc 3.0.*
- libbrotlicommon 1.0.9.*
- libbrotlidec 1.0.9.*
- libbrotlienc 1.0.9.*
- libcurl 8.7.1.*
- libcxx 14.0.6.*
- libdeflate 1.17.*
- libedit 3.1.20230828.*
- libev 4.33.*
- libffi 3.4.4.*
- libnghttp2 1.57.0.*
- libpng 1.6.39.*
- libssh2 1.11.0.*
- libtiff 4.5.1.*
- libwebp-base 1.3.2.*
- lz4-c 1.9.4.*
- matplotlib-base 3.8.4.*
- mkl 2023.1.0.*
- mkl_fft 1.3.8.*
- mkl_random 1.2.4.*
- ncurses 6.4.*
- numpy-base 1.26.4.*
- openjpeg 2.4.0.*
- openssl 3.0.14.*
- python 3.11.9.*
- python-tzdata 2023.3.*
- readline 8.2.*
- six 1.16.0.*
- sqlite 3.45.3.*
- tbb 2021.8.0.*
- tk 8.6.14.*
- tzdata 2024a.*
- xz 5.4.6.*
- zlib 1.2.13.*
- zstd 1.5.5.*