https://github.com/alan-turing-institute/election-ai-safety

https://github.com/alan-turing-institute/election-ai-safety

Science Score: 23.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
  • DOI references
  • Academic publication links
    Links to: arxiv.org
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (9.5%) to scientific vocabulary
Last synced: 7 months ago · JSON representation

Repository

Basic Info
  • Host: GitHub
  • Owner: alan-turing-institute
  • License: mit
  • Language: Jupyter Notebook
  • Default Branch: main
  • Size: 5.6 MB
Statistics
  • Stars: 0
  • Watchers: 2
  • Forks: 1
  • Open Issues: 0
  • Releases: 0
Created over 1 year ago · Last pushed over 1 year ago
Metadata Files
Readme License

README.md

LLM Election Disinfo Paper

Repo for "Large language models can consistently generate high-quality content for election disinformation operations."

This study uses prompto.

Diagram

📏 DisElect Eval

Each folder within data/evals (voting/: DisElect.VT, mps/: DisElect.MP , baseline/: DisElect.BL) contains templates and variables (data/evals/*/variables.json) used to construct sets of prompts (data/evals/*/prompts.csv), files for input to prompto (data/evals/*/eval.jsonl), and results (data/evals/*/results.csv) - results contain only judgements and not full model responses.

src/make_evals.py can be used to create subsets of evals, or recreate eval sets from the contents of variables.json, data/evals/models.csv, and data/evals/params.json.

Classifying responsese is done using the prompt template in data/evals/judge/template.txt on GPT-3.5 Turbo. src/evals/judge.py contains a lightweight judge implementation (we again use prompto for running batches of judgement prompts).

Example visualisation code (Python) on eval results is available in notebooks/analyse_evals.ipynb.

🤖 Experiments

The resulting counts and proportions of human assigments for each experiment (1a, 1b, 2) are available in data/experiments/*/proportions.csv.

Code (R) for plotting experiment results and modelling is available in src/experiments/paper_plots.R and src/experiments/regression.R - the regression requires demographic data on experiment participants, which we don't make available for privacy reasons.

Models:

Available in data/models/csv

| Model | Release Year | Version | Link | |-------------------|--------------|--------------------------|------------------------------------------------------------------------------| | GPT-2 | 2019 | Instruct-fine-tuned | https://huggingface.co/vicgalle/gpt2-open-instruct-v1 | | T5 | 2020 | XL (2.85B) | https://huggingface.co/google/t5-v1_1-xl | | GPT-Neo | 2021 | 2.7B | https://huggingface.co/EleutherAI/gpt-neo-2.7B | | Flan-T5 | 2022 | XL (2.85B) | https://huggingface.co/google/flan-t5-xl | | GPT-3.5 (t-d-003) | 2022 | davinci-003 | n/a | | GPT-3.5 Turbo | 2023 | gpt-3.5-turbo | https://platform.openai.com/docs/models/gpt-3-5-turbo | | GPT-4 | 2023 | gpt-4-0613 | https://platform.openai.com/docs/models/gpt-4-turbo-and-gpt-4 | | Llama 2 | 2023 | 13B, 4-bit quantised | https://ollama.com/library/llama2:13b | | Mistral | 2023 | 7B, 4-bit quantised | https://ollama.com/library/mistral:7b | | Gemini 1.0 Pro | 2023 | gemini-1.0-pro-002 | https://cloud.google.com/vertex-ai/generative-ai/docs/model-reference/gemini | | Phi-2 | 2023 | 2.7B, 4-bit quantised | https://ollama.com/library/phi:2.7b | | Gemma | 2024 | v1.1 7B, 4-bit quantised | https://ollama.com/library/gemma:v1.1 | | LLama 3 | 2024 | 70B, 4-bit quantised | https://ollama.com/library/llama3:70b |

Owner

  • Name: The Alan Turing Institute
  • Login: alan-turing-institute
  • Kind: organization
  • Email: info@turing.ac.uk

The UK's national institute for data science and artificial intelligence.

GitHub Events

Total
Last Year

Issues and Pull Requests

Last synced: about 1 year ago

All Time
  • Total issues: 0
  • Total pull requests: 0
  • Average time to close issues: N/A
  • Average time to close pull requests: N/A
  • Total issue authors: 0
  • Total pull request authors: 0
  • Average comments per issue: 0
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 0
  • Pull requests: 0
  • Average time to close issues: N/A
  • Average time to close pull requests: N/A
  • Issue authors: 0
  • Pull request authors: 0
  • Average comments per issue: 0
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
Pull Request Authors
Top Labels
Issue Labels
Pull Request Labels

Dependencies

pyproject.toml pypi
  • ipykernel ^6.29.4
  • jsonlines ^4.0.0
  • levenshtein ^0.25.1
  • matplotlib ^3.9.0
  • nbformat ^5.10.4
  • numpy ^2.0.0
  • openai ^1.35.7
  • pandas ^2.2.2
  • python ^3.10
  • quart ^0.19.6
  • scikit-learn ^1.5.0
  • seaborn ^0.13.2
  • spacy ^3.7.5
  • tiktoken ^0.7.0
  • torch ^2.3.0
  • transformers ^4.40.1