https://github.com/amazon-science/fq-bank

Science Score: 13.0%

This score indicates how likely this project is to be science-related based on various indicators:

○
CITATION.cff file
✓
codemeta.json file
Found codemeta.json file
○
.zenodo.json file
○
DOI references
○
Academic publication links
○
Academic email domains
○
Institutional organization owner
○
JOSS paper metadata
○
Scientific vocabulary similarity
Low similarity (11.7%) to scientific vocabulary

Last synced: 10 months ago · JSON representation

Repository

Basic Info

Host: GitHub
Owner: amazon-science
License: other
Default Branch: main
Size: 6.25 MB

Statistics

Stars: 3
Watchers: 2
Forks: 1
Open Issues: 0
Releases: 0

Created over 3 years ago · Last pushed over 3 years ago

Metadata Files

Readme Contributing License

Follow-up Query Bank (FQ-Bank)

The Follow-up Query (FQ) Bank has been released as part of the paper Learning to Retrieve Engaging Follow-Up Queries accepted at EACL 2023.

Abstract

Open domain conversational agents can answer a broad range of targeted queries. However, the sequential nature of interaction with these systems makes knowledge exploration a lengthy task which burdens the user with asking a chain of well phrased questions. In this paper, we present a retrieval based system and associated dataset for predicting the next questions that the user might have. Such a system can proactively assist users in knowledge exploration leading to a more engaging dialog. The retrieval system is trained on a dataset which contains ~14K multi-turn information-seeking conversations with a valid follow-up question and a set of invalid candidates. The invalid candidates are generated to simulate various syntactic and semantic confounders such as paraphrases, partial entity match, irrelevant entity, and ASR errors. We use confounder specific techniques to simulate these negative examples on the OR-QuAC dataset and develop a dataset called the Follow-up Query Bank (FQ-Bank). Then, we train ranking models on FQ-Bank and present results comparing supervised and unsupervised approaches. The results suggest that we can retrieve the valid follow-ups by ranking them in higher positions compared to confounders, but further knowledge grounding can improve ranking performance.

System Diagram

Data Format

The train, dev, and test sets are provided in three JSON files. Each JSON file contains a list. Each item in the list contains the following keys:

id : A dictionary having two keys dialogue and turn.
current_utterance : The current utterance
current_response : Response to the current utterance
dialog_history : A list of turns before the current utterance
candidate_utterances : Two lists with the keys valid and invalid contains the valid and invalid follow-up queries. Each of the invalid utterances is accompanied by the the reason explaining why the utterance is not a valid follow-up.

Citation

bibtex @inproceedings{richardson-etal-2023-fqbank, title = "Learning to Retrieve Engaging Follow-Up Queries", author = "Richardson, Christopher and Kar, Sudipta and Kumar, Anjishnu and Ramachandran, Anand and Zia Khan, Omar and Raeesy, Zeynab and Sethy, Abhinav", booktitle = "Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics", year = "2023" }

Security

See CONTRIBUTING for more information.

License Summary

The data is made available under the Creative Commons Attribution-ShareAlike 4.0 International License. See the LICENSE file.

The sample code is made available under the MIT-0 license. See the LICENSE-SAMPLECODE file.

Owner

Name: Amazon Science
Login: amazon-science
Kind: organization

Website: https://amazon.science
Twitter: AmazonScience
Repositories: 80
Profile: https://github.com/amazon-science

GitHub Events

Total

Fork event: 1

Last Year

Fork event: 1

Issues and Pull Requests

Last synced: about 1 year ago

All Time

Total issues: 0
Total pull requests: 0
Average time to close issues: N/A
Average time to close pull requests: N/A
Total issue authors: 0
Total pull request authors: 0
Average comments per issue: 0
Average comments per pull request: 0
Merged pull requests: 0
Bot issues: 0
Bot pull requests: 0

Past Year

Issues: 0
Pull requests: 0
Average time to close issues: N/A
Average time to close pull requests: N/A
Issue authors: 0
Pull request authors: 0
Average comments per issue: 0
Average comments per pull request: 0
Merged pull requests: 0
Bot issues: 0
Bot pull requests: 0

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Open Source Science