https://github.com/awslabs/recoe

Science Score: 13.0%

This score indicates how likely this project is to be science-related based on various indicators:

○
CITATION.cff file
✓
codemeta.json file
Found codemeta.json file
○
.zenodo.json file
○
DOI references
○
Academic publication links
○
Academic email domains
○
Institutional organization owner
○
JOSS paper metadata
○
Scientific vocabulary similarity
Low similarity (6.4%) to scientific vocabulary

Last synced: 10 months ago · JSON representation

Repository

Basic Info

Host: GitHub
Owner: awslabs
License: other
Default Branch: main
Size: 2.48 MB

Statistics

Stars: 6
Watchers: 3
Forks: 1
Open Issues: 0
Releases: 0

Created over 1 year ago · Last pushed over 1 year ago

Metadata Files

Readme Contributing License Code of conduct

ReCoE: Reasoning-based Counterfactual Editing Dataset

ReCoE is a benchmark dataset designed to evaluate the effectiveness of knowledge editing techniques in language models, particularly focusing on the propagation of updated knowledge through various reasoning tasks. The dataset encompasses six common reasoning schemes: Superlative, Comparative, Sorting, Counting, Aggregation and Subtraction. Each subset contains questions that require the model to reason over interconnected facts, assessing its ability to integrate and propagate edited knowledge accurately.

Please refer to our paper for more details.

Dataset Structure

The dataset is organized into six sub-directories, each corresponding to one of the reasoning schemes.

data/superlative
data/comparative
data/sorting
data/counting
data/aggregation
data/subtraction

Each sub-directory contains a JSON file that includes questions, original answers (aliases) along with their supporting facts (identified by the keys choice_1_facts and choice_1_counterfactuals), as well as counterfactual answers (aliases) accompanied by their respective supporting counterfactuals (identified by the keys choice_2_facts and choice_2_counterfactuals).

An example from the Superlative category is shown below: json { "question": "Who is the last celebrity Brody Jenner had a romantic relationship with?", "answer": "Lauren Conrad", "counterfactual_answer": "Heidi Montag", "facts_per_choice": { "choice_1_facts": [ { "fact": "Lauren Conrad and Brody Jenner briefly dated in 2006.", "links": [ "https://en.wikipedia.org/wiki/Lauren_Conrad" ], "atomic_facts": [ "Lauren Conrad and Brody Jenner's dating was brief.", "Lauren Conrad and Brody Jenner's brief dating was in 2006." ], "atomic_triples": [ "(Lauren Conrad and Brody Jenner's dating; was; brief)", "(Lauren Conrad and Brody Jenner's brief dating; was in; 2006)" ] } ], "choice_2_facts": [ { "fact": "Heidi Montag was never romantically involved with Brody Jenner.", "links": [ "https://en.wikipedia.org/wiki/Heidi_Montag" ], "atomic_facts": [ "Heidi Montag was never romantically involved with Brody Jenner." ], "atomic_triples": [ "(Heidi Montag; was never romantically involved with; Brody Jenner)" ] } ] }, "counterfactuals_per_choice": { "choice_1_counterfactuals": [ { "fact": "Lauren Conrad was never romantically involved with Brody Jenner.", "atomic_facts": [ "Lauren Conrad was never romantically involved with Brody Jenner." ], "atomic_triples": [ "(Lauren Conrad; was never romantically involved with; Brody Jenner)" ] } ], "choice_2_counterfactuals": [ { "fact": "Heidi Montag and Brody Jenner briefly dated in 2006.", "atomic_facts": [ "Heidi Montag and Brody Jenner's dating was brief.", "Heidi Montag and Brody Jenner's brief dating was in 2006." ], "atomic_triples": [ "(Heidi Montag and Brody Jenner's dating; was; brief)", "(Heidi Montag and Brody Jenner's brief dating; was in; 2006)" ] } ] }, "answer_alias": ["Lauren K. Conrad", "Lauren Katherine Conrad", "L.C."], "counterfactual_answer_alias": ["Heidi Pratt", "Heidi Blair Montag", "Heidi B. Montag"] }

In the sorting subset, both the original and counterfactual answers consist of the same set of entities in different orders. The supporting facts correspond to each entity.

For the aggregation and subtraction subsets, we provide two versions of supporting facts (or counterfactuals): one set refers to direct facts, while the other pertains to indirect facts. In the paper, only the indirect facts are utilized for editing.

Dataset Statistics

| Reasoning Scheme | Examples | Atomic facts | |-------------|----------|--------------| | superlative | 1,172 | 7,374 | | comparative | 1,153 | 7,144 | | counting | 643 | 1,309 | | sorting | 1,034 | 6,009 | | aggregation | 508 | 1,822 | | subtraction | 500 | 1,500 | | Total | 5,010| 25,158 |

Citation

If you use ReCoE in your research, please cite the original paper: @inproceedings{hua-etal-2024-propagation, title = "Propagation and Pitfalls: Reasoning-based Assessment of Knowledge Editing through Counterfactual Tasks", author= "Hua, Wenyue and Guo, Jiang and Dong, Mingwen and Zhu, Henghui and Ng, Patrick and Wang, Zhiguo", booktitle = "Findings of the Association for Computational Linguistics: ACL 2024", month = aug, year = "2024", address = "Bangkok, Thailand", publisher = "Association for Computational Linguistics", pages = "12503--12525", }

License

This dataset is released under the CC-BY-NC-ND-4.0 license.

Owner

Name: Amazon Web Services - Labs
Login: awslabs
Kind: organization
Location: Seattle, WA

Website: http://amazon.com/aws/
Repositories: 914
Profile: https://github.com/awslabs

AWS Labs

GitHub Events

Total

Watch event: 5
Public event: 1
Fork event: 1

Last Year

Watch event: 5
Public event: 1
Fork event: 1

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Open Source Science