pqc-paper-repo
Science Score: 54.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
○DOI references
-
✓Academic publication links
Links to: zenodo.org -
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (7.6%) to scientific vocabulary
Repository
Basic Info
- Host: GitHub
- Owner: pws1453
- License: gpl-3.0
- Language: Python
- Default Branch: main
- Size: 617 KB
Statistics
- Stars: 0
- Watchers: 1
- Forks: 1
- Open Issues: 0
- Releases: 8
Metadata Files
README.md
Trend Analyzing Algorithm designed alongside "A Survey of Zero-Knowledge Proofs in a Post-Quantum Context"
An Overview
This script is designed to mine the abstracts in the 'abstracts.csv' file, to find common words or phrases within them. These abstracts will be separated by the year. Keyphrases or keywords will only be recorded if they occur in more than one paper, as to avoid recording words or phrases unique to a single paper.
Respository Architecture
Important Folders
paper-sorted contains the curated csvs that were used to derive certain figures in the paper.
unsorted-sanitized contain the csvs generated by the python script, which will sanitize the abstracts.csv file, in order to perform frequency analysis.
Important Files
abstracts.csv is a CSV containing only the abstract, and the year. It is used with the python script.
article-info.csv is a CSV containing authorship information, and other metadata regarding the articles whose abstracts we mined. All of these abstracts are publicly-availible. We thank the authors of these articles and the publishers for making them so.
blocklist.txt is a newline-delimited file, containing words that should not be accounted for in the frequency analysis
mine-year.py is the main python script. There are no arguments or options.
Usage
python3 mine-year.py
Procedures used
To derive the results located in the 'unsorted-sanitized', run the script.
The CSVs provided in the 'paper-sorted' folder were processed additionally. To create these files, we accounted for plural or alternate forms of words by adding the instances of these forms to the root word.
Owner
- Login: pws1453
- Kind: user
- Repositories: 5
- Profile: https://github.com/pws1453
Citation (CITATION.cff)
cff-version: 1.2.0
title: >-
Trend Analyzing Algorithm designed alongside "A
Survey of Zero-Knowledge Proofs in a Post-Quantum
Context"
message: >-
If you use this software as part of a software
project, or to guide a research project, please
cite it as below.
type: software
authors:
- given-names: Preston
family-names: Sergent
- given-names: Bryan
family-names: Richez
- given-names: Alexa
family-names: Krempa
- given-names: Sam
family-names: Benoist
- given-names: Jay
family-names: Shah
identifiers:
- type: doi
value: 10.5281/zenodo.6510301
- type: url
value: 'https://github.com/pws1453/pqc-paper-repo'
GitHub Events
Total
- Push event: 1
- Create event: 1
Last Year
- Push event: 1
- Create event: 1