Science Score: 54.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
  • Academic publication links
    Links to: zenodo.org
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (7.6%) to scientific vocabulary
Last synced: 10 months ago · JSON representation ·

Repository

Basic Info
  • Host: GitHub
  • Owner: pws1453
  • License: gpl-3.0
  • Language: Python
  • Default Branch: main
  • Size: 617 KB
Statistics
  • Stars: 0
  • Watchers: 1
  • Forks: 1
  • Open Issues: 0
  • Releases: 8
Created about 4 years ago · Last pushed over 1 year ago
Metadata Files
Readme License Citation

README.md

DOI

Trend Analyzing Algorithm designed alongside "A Survey of Zero-Knowledge Proofs in a Post-Quantum Context"

An Overview

This script is designed to mine the abstracts in the 'abstracts.csv' file, to find common words or phrases within them. These abstracts will be separated by the year. Keyphrases or keywords will only be recorded if they occur in more than one paper, as to avoid recording words or phrases unique to a single paper.

Respository Architecture

Important Folders

paper-sorted contains the curated csvs that were used to derive certain figures in the paper.

unsorted-sanitized contain the csvs generated by the python script, which will sanitize the abstracts.csv file, in order to perform frequency analysis.

Important Files

abstracts.csv is a CSV containing only the abstract, and the year. It is used with the python script.

article-info.csv is a CSV containing authorship information, and other metadata regarding the articles whose abstracts we mined. All of these abstracts are publicly-availible. We thank the authors of these articles and the publishers for making them so.

blocklist.txt is a newline-delimited file, containing words that should not be accounted for in the frequency analysis

mine-year.py is the main python script. There are no arguments or options.

Usage

python3 mine-year.py

Procedures used

To derive the results located in the 'unsorted-sanitized', run the script.

The CSVs provided in the 'paper-sorted' folder were processed additionally. To create these files, we accounted for plural or alternate forms of words by adding the instances of these forms to the root word.

Owner

  • Login: pws1453
  • Kind: user

Citation (CITATION.cff)

cff-version: 1.2.0
title: >-
  Trend Analyzing Algorithm designed alongside "A
  Survey of Zero-Knowledge Proofs in a Post-Quantum
  Context"
message: >-
  If you use this software as part of a software
  project, or to guide a research project, please
  cite it as below.
type: software
authors:
  - given-names: Preston
    family-names: Sergent
  - given-names: Bryan
    family-names: Richez
  - given-names: Alexa
    family-names: Krempa
  - given-names: Sam
    family-names: Benoist
  - given-names: Jay
    family-names: Shah
identifiers:
  - type: doi
    value: 10.5281/zenodo.6510301
  - type: url
    value: 'https://github.com/pws1453/pqc-paper-repo'

GitHub Events

Total
  • Push event: 1
  • Create event: 1
Last Year
  • Push event: 1
  • Create event: 1