graphandllmbasedredundancyanalysis

https://github.com/hofmannl/graphandllmbasedredundancyanalysis

Science Score: 54.0%

This score indicates how likely this project is to be science-related based on various indicators:

✓
CITATION.cff file
Found CITATION.cff file
✓
codemeta.json file
Found codemeta.json file
✓
.zenodo.json file
Found .zenodo.json file
○
DOI references
✓
Academic publication links
Links to: zenodo.org
○
Academic email domains
○
Institutional organization owner
○
JOSS paper metadata
○
Scientific vocabulary similarity
Low similarity (13.2%) to scientific vocabulary

Last synced: 9 months ago · JSON representation ·

Repository

Basic Info

Host: GitHub
Owner: Hofmannl
Language: Jupyter Notebook
Default Branch: main
Size: 33.7 MB

Statistics

Stars: 0
Watchers: 1
Forks: 0
Open Issues: 1
Releases: 0

Created over 1 year ago · Last pushed over 1 year ago

Metadata Files

Readme Citation

Abstract

User stories (USs) are a widely used notation for requirements in agile software development. In large software projects, redundancies between USs can easily occur, and unresolved redundancies can impact software quality. While some redundancy may be acceptable, it is crucial for requirements engineers to know where redundancies occur in their projects. To address this, we want to provide automated approaches to analyse sets of USs in order to prevent a decrease of software quality due to the realisation of redundant USs. This paper presents two approaches to redundancy analysis. The first approach is based on an a conceptual model for USs, called annotation graph, containing their main actions and entities; the other uses large language models (LLMs). We present the concepts and tools of both approaches and evaluate their potential and limitations by applying them to a large corpus of USs. Our results show that the graph-based approach can correctly analyse a strict form of redundancy, while the inherently fuzzy LLM-based approach is more adept at recognising semantic redundancies. Thus, this study contributes to the advancement of automated quality control of USs.

Example User Story

Redundancy Detection with LLMs

Redundant requirements can be paired and between this pairs redundancies can be found.

We use the User Stories from nlp-stories and user story repo. We compared our LLM approach against a formal graph-based approach.

Folder explanation

Datasets: Contains the annotated nlp-stories
ExamplesForPmc: Contains some concrete examples how the actual interaction (prompts) is written based on the formal structure in Prompt Examples.
resultsgptapproach
- Results for general redundancy analyses (Redundancy_Analyse.xlsx)
- Results for strict redundancy analyses (StrictRedundancyAnalyse.xlsx)
- Results for ignored US and time consumption (to find in the outputdetectionshighlighting_*.xlsx)
resultsgraphapproach
- Results of the strict redundancy analyses for G19 in JSON + CSV format
- Results of the strict redundancy analyses for G22 in JSON + CSV format
srcgptapproach: containg the source code of our implementation. For installation, please refer to our installation and dependency description.
- [ ] /controller: Implementation of the Fask API endpoint.
- [ ] /future_work: Contains future experiments in jupyter notebook form (partly done).
- [X] /prompt_structure : Contains the prompts used for our experiments.
- [X] /results: All results achieved during our experiments.
- [X] /support_functions: As our implementation is based on a functional approach this src folder contains all abstracted functions.
- [X] /supportfunctionstest: Tests for the support functions (pythonic style to organise tests).
- [X] /utils: Utility functions.
- [X] /Various Jupyter Notebooks: To connect and execute to our progam logic.
- [X] /setup.py: -
srcgraphapproach:
- Implementation of the graph-based strict redundancy analyses (to find in the main.ipynb)
- ExperimentsWordSimilarity: Use of distance metrics to determine the similarity of words (annotations)

Architecture

Component-Diagramm

Workflow

IDE + Plugins

VSCode
Jupyter
Jupyter Cell Tags
Jupyter Keymap
Pylint
Pylance
Python
Python Debugger
Python Environment Manager

Dependencies

Listed in the requirements.txt

Installation guide

Installation for the GPT approach

Install python == 3.12
Create a .venv
pip pip install -r //requirements.txt
pip install . in src
configure jupyter notebook env
start jupyter notebook in the srcgptapproach folder

Installation for the Graph approach

Install python == 3.12
Create a .venv
pip pip install -r //requirements.txt
pip install . in src
configure jupyter notebook env
start jupyter notebook in src (not in the srcgraphapproach)

Start-Up Env

For Windows, we have developed scripts to automate the installation and recompilation of dependencies.
After completing the GPT Approach Installation, you can use the startenvgpt_approach.ps1 script to launch all services for the GPT approach. (Installation for the GPT approach) Note: Each time the script is executed, the local source code is freshly compiled, and dependencies are verified for the correct versions. If discrepancies are found, they are reinstalled automatically.
Similarly, after completing the Graph Approach Installation, you can use the startenvgraph_approach.ps1 script to launch all services for the Graph approach. (Installation for the Graph approach) Note: As with the Graph approach, executing this script will recompile the local source code and ensure dependencies are up-to-date, reinstalling them if necessary.
Currently, only Windows is supported. Other operating systems such as Linux, MacOS, and Unix are not yet supported. However, we plan to streamline the automated installation and recompilation process for these platforms in the future.
Furthermore, we plan to develop a backend service for both strategies and containerize the application using Docker. This will enable semi-technical staff to easily use the containers and integrate them into their applications.

Env

You have to create a .env-file in the srcgptapproach folder. The following entries have to be considered:

OPENAIAPIKEY (Enter your API key here)
MODEL_VERSION (GPT 3.5 = gpt-3.5-turbo points to gpt-3.5-turbo-0125, GPT-4o = gpt-4o-2024-05-13)
TPM (Tokens-Per-Minute)
RPM (Requests-Per-Minute)
TOKEN_DELTA (Approximation of the Request Size to avoid GPT "Deadlocks")
TEMPERATURE (e.g. 0.2)
OUTPUTEXCELNAMEWITHJUST_TEXT #Insert here the name of the .xlsx-file which stores the results of experiments where only text has been used
OUTPUTEXCELNAMEWITHANNOTATIONS #Insert here the name of the .xlsx-file which stores the results of experiments where only annotations have been used
OUTPUTEXCELNAMEWITHTEXTANDANNOTATIONS #Insert here the name of the .xlsx-file which stores the results of experiments where annotations and text have been used
OUTPUTEXCELNAMEJUSTTEXTUAL #Insert here the name of the .xlsx-file which stores the results of experiments where only text has been used
OUTPUTEXCELSTRICTREDUNDANCYNAMEJUSTTEXTUAL #Insert here the name of the .xlsx-file which stores the results of experiments where only text has been used and the prompt was tuned for strict redundancy
OUTPUTEXCELNAMEWITHSTRICT_ANNOTATIONS #Insert here the name of the .xlsx-file which stores the results of experiments where only annotations have been used and the prompt was tuned for strict redundancy
THRESHOLD_REPAIR (Any number of repairs, e.g., "3")
THREADING ("1" = ON, "2" = OFF)
THREAD_MULTIPLICATOR (Any number for thread multiplication > 0)
LIMIT ("-1" =None, "n" = any number)

Prompt Example

An Examples of a Prompt-Message-Chain (PMC) can be found in the ExamplesForPmc-Folder. This PMC follows the structure of:

[ M{Sequence} = P{Context-Framing} + \sum{i=1}^{n} P{Example}^{i} + P{Process-Request} + \sum{j=1}^{\tau} \left( P{Agent-Answer}^{j} + P{Repair-Request}^{j} \right) ]

Where:

( n \in \mathbb{N} ) is the number of shots in a few-shot prompt.
( j \in \mathbb{N} ) is the number of repair attempts required to obtain a valid output, constrained by a preset repair threshold ( \tau ) that must not be exceeded.

The "+" operator indicates concatenation of strings, which includes insertion of a system simulation.

Additional Details

( P_{Context-Framing} ): Contains explanations and content for:
- The actor role,
- The redundancy definition (Def. 3),
- The output format.
( P_{Example}^{i} ): Consists of a US pair example containing an input and its corresponding redundancy output. These examples frame the LLM agent for the specific task. Here, ( n ) is the number of examples to insert.
( P_{Process-Request} ): A prompt instructing the agent to analyze a particular pair of USs and respond with a JSON string.
If an invalid response is generated, the prompt is re-sent with additional prompts:
- ( P_{Agent-Answer}^{j} ): Represents the invalid JSON response.
- ( P_{Repair-Request}^{j} ): A prompt describing why the JSON response does not match the expected schema.

Contributors

This Repo was submitted at REFSQ 2025

This Repo was created for 'Agile Development: Redundancy Analysis of User Stories with Graphs and Large Language Models' for the Requirements Engineering: Foundation for Software Quality (REFSQ) 2025

Owner

Name: Lukas Sebastian H.
Login: Hofmannl
Kind: user

Website: https://www.uni-marburg.de/de/fb12/arbeitsgruppen/swt/team
Repositories: 2
Profile: https://github.com/Hofmannl

Citation (CITATION.cff)

cff-version: 1.0.0
message: |
  If you use this software, please cite it as below and pay attention to the following requirements:
  - Ensure proper attribution in any publications or presentations.
  - Adhere to the licensing terms specified in the repository.
  - Contact the authors for any commercial use or redistribution.
authors:
  - family-names: Hofmann
    given-names: Lukas Sebastian
    https://orcid.org/0009-0004-3823-3894
- family-names: Lauer
    given-names: Alexander
- family-names: Taentzer
    given-names: Gabriele
title: "Agile Development in the Era of GenAI: Redundancy Analysis of User Stories with Formal Approaches and Large Language Models"
version: 1.0.0
date-released: 2024-07-05

GitHub Events

Total

Issues event: 1
Member event: 2
Push event: 26
Public event: 1
Create event: 2

Last Year

Issues event: 1
Member event: 2
Push event: 26
Public event: 1
Create event: 2

Dependencies

Datasets/nlp/nlp_tools/visual_narrator/Dockerfile docker

python 3.6 build

src_graph_approach/ExperimentsWordSimilarity/pom.xml maven

info.debatty:java-string-similarity 2.0.0
org.apache.commons:commons-text 1.12.0
org.json:json 20210307

Datasets/nlp/nlp_tools/visual_narrator/requirements.txt pypi

Django >=1.9
Jinja2 >=2.9.5
numpy >=1.16.2
pandas >=0.24.2
spacy >=2.1.2

Datasets/nlp/nlp_tools/visual_narrator/setup.py pypi

requirements.txt pypi

src_gpt_approach/setup.py pypi

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Open Source Science

graphandllmbasedredundancyanalysis

Science Score: 54.0%

Repository

Basic Info

Statistics

Metadata Files

README.md

Abstract

Redundancy Detection with LLMs

Folder explanation

Architecture

Workflow

IDE + Plugins

Dependencies

Installation guide

Start-Up Env

Env

Prompt Example

Additional Details

Contributors

This Repo was submitted at REFSQ 2025

Owner

Citation (CITATION.cff)

GitHub Events

Total

Last Year

Dependencies