quebec-insurance-rag-corpora

Quebec Automobile Insurance Question-Answering With Retrieval-Augmented Generation

https://github.com/graal-research/quebec-insurance-rag-corpora

Science Score: 44.0%

This score indicates how likely this project is to be science-related based on various indicators:

✓
CITATION.cff file
Found CITATION.cff file
✓
codemeta.json file
Found codemeta.json file
✓
.zenodo.json file
Found .zenodo.json file
○
DOI references
○
Academic publication links
○
Academic email domains
○
Institutional organization owner
○
JOSS paper metadata
○
Scientific vocabulary similarity
Low similarity (8.0%) to scientific vocabulary

Last synced: 10 months ago · JSON representation ·

Repository

Quebec Automobile Insurance Question-Answering With Retrieval-Augmented Generation

Basic Info

Host: GitHub
Owner: GRAAL-Research
Default Branch: main
Homepage:
Size: 299 KB

Statistics

Stars: 2
Watchers: 3
Forks: 0
Open Issues: 0
Releases: 0

Created over 1 year ago · Last pushed over 1 year ago

Metadata Files

Readme Citation

Quebec Automobile Insurance Question-Answering With Retrieval-Augmented Generation

About the Dataset

The dataset consists of two components: 1) the references corpus and 2) the Questions-Answering corpus (QnA).

References Corpus

It contains Quebec legislatures and pieces of automotive insurance references. For more details, see our article.

QnA Corpus

It contains a set of 82 automotive questions extracted from the Web. For more details, see our article.

To Cite

bibtex @inproceedings{beauchemin-etal-2024-quebec, title = "{Q}uebec Automobile Insurance Question-Answering With Retrieval-Augmented Generation", author = "Beauchemin, David and Khoury, Richard and Gagnon, Zachary", editor = "Aletras, Nikolaos and Chalkidis, Ilias and Barrett, Leslie and Goan{\textcommabelow{t}}{\u{a}}, C{\u{a}}t{\u{a}}lina and Preo{\textcommabelow{t}}iuc-Pietro, Daniel and Spanakis, Gerasimos", booktitle = "Proceedings of the Natural Legal Language Processing Workshop 2024", month = nov, year = "2024", address = "Miami, FL, USA", publisher = "Association for Computational Linguistics", url = "https://aclanthology.org/2024.nllp-1.5", pages = "48--60", abstract = "Large Language Models (LLMs) perform outstandingly in various downstream tasks, and the use of the Retrieval-Augmented Generation (RAG) architecture has been shown to improve performance for legal question answering (Nuruzzaman and Hussain, 2020; Louis et al., 2024). However, there are limited applications in insurance questions-answering, a specific type of legal document. This paper introduces two corpora: the Quebec Automobile Insurance Expertise Reference Corpus and a set of 82 Expert Answers to Layperson Automobile Insurance Questions. Our study leverages both corpora to automatically and manually assess a GPT4-o, a state-of-the-art (SOTA) LLM, to answer Quebec automobile insurance questions. Our results demonstrate that, on average, using our expertise reference corpus generates better responses on both automatic and manual evaluation metrics. However, they also highlight that LLM QA is unreliable enough for mass utilization in critical areas. Indeed, our results show that between 5{\%} to 13{\%} of answered questions include a false statement that could lead to customer misunderstanding.", }

Owner

Name: GRAAL/GRAIL
Login: GRAAL-Research
Kind: organization
Location: Québec, QC

Website: grail.ift.ulaval.ca
Repositories: 24
Profile: https://github.com/GRAAL-Research

Machine Learning Research Group - Université Laval

Citation (CITATION.cff)

cff-version: 1.2.0
preferred-citation:
  type: article
  message: "If you this dataset, please cite it as below."
  authors:
  - family-names: "Beauchemin"
    given-names: "David"
  - family-names: "Saggion"
    given-names: "Horacio"
  - family-names: "Richard"
    given-names: "Khoury"
  title: "MeaningBERT: Assessing Meaning Preservation Between Sentences"
  url: "https://www.frontiersin.org/articles/10.3389/frai.2023.1223924"
  year: 2024

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Open Source Science

quebec-insurance-rag-corpora

Science Score: 44.0%

Repository

Basic Info

Statistics

Metadata Files

README.md

Quebec Automobile Insurance Question-Answering With Retrieval-Augmented Generation

About the Dataset

References Corpus

QnA Corpus

To Cite

Owner

Citation (CITATION.cff)

GitHub Events

Total

Last Year