project-implementation

COURT framework test implementation and materials. Developed during ISWS 2024 by team Hufflepuff.

https://github.com/isws-hufflepuff/project-implementation

Science Score: 44.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (17.0%) to scientific vocabulary
Last synced: 6 months ago · JSON representation ·

Repository

COURT framework test implementation and materials. Developed during ISWS 2024 by team Hufflepuff.

Basic Info
  • Host: GitHub
  • Owner: isws-hufflepuff
  • Language: Jupyter Notebook
  • Default Branch: main
  • Homepage:
  • Size: 2.85 MB
Statistics
  • Stars: 0
  • Watchers: 0
  • Forks: 0
  • Open Issues: 0
  • Releases: 1
Created over 1 year ago · Last pushed over 1 year ago
Metadata Files
Readme Citation

README.md

README

This repository contains the code and resources of the ISWS 2024 Hufflepuff experiments described in the “Straight to COURT!“: COnstructing a generalizable framework for probing the ethical boUndaries of laRge language models across mulTiple domains” report.

Repository structure

  • /implemented-ex-material/copyright: Data (prompts and sucessive steps outputs) used and provide for the semi-automatic implementation of the project, focus on the copyright use case.
  • /notebooks: Contains the main code to run LLMs.
    • /notebooks/help: Contains some tutorial notebooks to use Ollama and to run Ollama with Google Colab (small LLMs only).
  • /evaluation-protocols: Contains sheets describing the tested prompts and returned answers for the different use cases (see the README file inside this folder).
  • /knowledge-graph : Contains ontology and use cases implementation
  • /report.zip : Contains latex source code of the report
  • ISWS2024_Hufflepuff.pdf : Report
  • README.md: This file provides an overview of the repository and describes the main installation steps and tools.

Installation

Static Badge

Ollama server

The following commands are executed in terminal * Install Ollama (choose the relevant OS). For Linux : bash curl -fsSL https://ollama.com/install.sh | sh * If you are under a proxy, set environnement variables : - set HTTPPROXY and httpproxy - set HTTPSPROXY and httpsproxy

  • To run Ollama (default HOST is 127.0.0.1:11434) bash ollama serve
  • To download a model bash ollama pull MODEL-NAME
  • To see the list of downloaded models bash ollama list
  • To run a model in terminal (see Ollama Python to run a model in a Python script) bash ollama run MODEL-NAME ### Ollama Python Static Badge

Ollama can also be run using a Python library. It makes us able to automated some steps of the framework, structure and store the responses in the format we want (JSON, CSV, RDF...).

Models in Ollama

Only Llama3:7B model has been tested in the automated implementation of our work.

| Model Name in Ollama | Developped by | Size |Number of parameters|Censored ?| |------------|-------------|---------|-------------|---------| | llama3:7b|Meta|4.7GB|8.03b|Yes & No| | mistral |Mistral AI|4.1GB|7.25b|No| | gemma:7b |Google|5GB|8.54b|Yes|

Running notebooks

Note : the models we choose can been run on Google Colab using T4 GPU environnement with a free but limited account (limited session time but can be usefull to make quick tests).

Owner

  • Name: isws-hufflepuff
  • Login: isws-hufflepuff
  • Kind: organization

Citation (CITATION.cff)

cff-version: 1.2.0
message: "If you use this software, please cite it as below."
title: "Straight to COURT! COnstructing a generalizable framework for probing the ethical boUndaries of laRge language models across mulTiple domains."
authors:
  - family-names: "Ondraszek"
    given-names: "Sarah Rebecca"
  - family-names: "Tual"
    given-names: "Solenn"
  - family-names: "Contesotto"
    given-names: "Ilaria"
  - family-names: "Cuccarini"
    given-names: "Marco"
  - family-names: "Kompuš"
    given-names: "Patrik"
  - family-names: "De Giorgis"
    given-names: "Stefano"
  - family-names: "Kirrane"
    given-names: "Sabrina"
version: 1.0.0
date-released: 2024-06-14
keywords:
  - LLMs
  - Knowledge Graphs
  - Privacy
  - Copyright
  - Biais

GitHub Events

Total
Last Year