pqc-paper-repo

https://github.com/pws1453/pqc-paper-repo

Science Score: 54.0%

This score indicates how likely this project is to be science-related based on various indicators:

✓
CITATION.cff file
Found CITATION.cff file
✓
codemeta.json file
Found codemeta.json file
✓
.zenodo.json file
Found .zenodo.json file
○
DOI references
✓
Academic publication links
Links to: zenodo.org
○
Academic email domains
○
Institutional organization owner
○
JOSS paper metadata
○
Scientific vocabulary similarity
Low similarity (7.6%) to scientific vocabulary

Last synced: 10 months ago · JSON representation ·

Repository

Basic Info

Host: GitHub
Owner: pws1453
License: gpl-3.0
Language: Python
Default Branch: main
Size: 617 KB

Statistics

Stars: 0
Watchers: 1
Forks: 1
Open Issues: 0
Releases: 8

Created about 4 years ago · Last pushed over 1 year ago

Metadata Files

Readme License Citation

Trend Analyzing Algorithm designed alongside "A Survey of Zero-Knowledge Proofs in a Post-Quantum Context"

An Overview

This script is designed to mine the abstracts in the 'abstracts.csv' file, to find common words or phrases within them. These abstracts will be separated by the year. Keyphrases or keywords will only be recorded if they occur in more than one paper, as to avoid recording words or phrases unique to a single paper.

Respository Architecture

Important Folders

paper-sorted contains the curated csvs that were used to derive certain figures in the paper.

unsorted-sanitized contain the csvs generated by the python script, which will sanitize the abstracts.csv file, in order to perform frequency analysis.

Important Files

abstracts.csv is a CSV containing only the abstract, and the year. It is used with the python script.

article-info.csv is a CSV containing authorship information, and other metadata regarding the articles whose abstracts we mined. All of these abstracts are publicly-availible. We thank the authors of these articles and the publishers for making them so.

blocklist.txt is a newline-delimited file, containing words that should not be accounted for in the frequency analysis

mine-year.py is the main python script. There are no arguments or options.

Usage

python3 mine-year.py

Procedures used

To derive the results located in the 'unsorted-sanitized', run the script.

The CSVs provided in the 'paper-sorted' folder were processed additionally. To create these files, we accounted for plural or alternate forms of words by adding the instances of these forms to the root word.

Owner

Login: pws1453
Kind: user

Repositories: 5
Profile: https://github.com/pws1453

Citation (CITATION.cff)

cff-version: 1.2.0
title: >-
  Trend Analyzing Algorithm designed alongside "A
  Survey of Zero-Knowledge Proofs in a Post-Quantum
  Context"
message: >-
  If you use this software as part of a software
  project, or to guide a research project, please
  cite it as below.
type: software
authors:
  - given-names: Preston
    family-names: Sergent
  - given-names: Bryan
    family-names: Richez
  - given-names: Alexa
    family-names: Krempa
  - given-names: Sam
    family-names: Benoist
  - given-names: Jay
    family-names: Shah
identifiers:
  - type: doi
    value: 10.5281/zenodo.6510301
  - type: url
    value: 'https://github.com/pws1453/pqc-paper-repo'

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Open Source Science