https://github.com/amazon-science/real-world-noisy-benchmarks-for-natural-language-understanding

Benchmark test sets for real-world noise phenomena in goal-directed conversations in English.

https://github.com/amazon-science/real-world-noisy-benchmarks-for-natural-language-understanding

Science Score: 26.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
  • DOI references
    Found 1 DOI reference(s) in README
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (7.0%) to scientific vocabulary

Keywords

dataset natural-language-processing natural-language-understanding
Last synced: 10 months ago · JSON representation

Repository

Benchmark test sets for real-world noise phenomena in goal-directed conversations in English.

Basic Info
  • Host: GitHub
  • Owner: amazon-science
  • License: other
  • Default Branch: main
  • Homepage:
  • Size: 133 KB
Statistics
  • Stars: 3
  • Watchers: 8
  • Forks: 1
  • Open Issues: 0
  • Releases: 0
Topics
dataset natural-language-processing natural-language-understanding
Created about 5 years ago · Last pushed almost 4 years ago
Metadata Files
Readme Contributing License

README.md

Real World Noise Benchmarks for Natural Language Understanding

Project

If you use this dataset, please cite the following paper: @inproceedings{sengupta-etal-2021-robustness, title = "On the Robustness of Intent Classification and Slot Labeling in Goal-oriented Dialog Systems to Real-world Noise", author = "Sengupta, Sailik and Krone, Jason and Mansour, Saab", booktitle = "Proceedings of the 3rd Workshop on Natural Language Processing for Conversational AI", month = nov, year = "2021", address = "Online", publisher = "Association for Computational Linguistics", url = "https://aclanthology.org/2021.nlp4convai-1.7", doi = "10.18653/v1/2021.nlp4convai-1.7", pages = "68--79" }

The directory structure is as follows:

- data - atis - abbreviations - treatment_test.tsv - allcaps - treatment_test.tsv - misspellings - treatment_test.tsv - morphological - treatment_test.tsv - paraphrase - treatment_test.tsv - punctuation - treatment_test.tsv - synonyms - treatment_test.tsv - snips - abbreviations - treatment_test.tsv - allcaps - treatment_test.tsv - misspellings - treatment_test.tsv - morphological - treatment_test.tsv - paraphrase - treatment_test.tsv - synonyms - treatment_test.tsv

Security

See CONTRIBUTING for more information.

License Summary

The documentation is made available under the Creative Commons Attribution-NonCommercial 4.0 International License. See the LICENSE file.

Owner

  • Name: Amazon Science
  • Login: amazon-science
  • Kind: organization

GitHub Events

Total
Last Year

Issues and Pull Requests

Last synced: over 1 year ago

All Time
  • Total issues: 0
  • Total pull requests: 0
  • Average time to close issues: N/A
  • Average time to close pull requests: N/A
  • Total issue authors: 0
  • Total pull request authors: 0
  • Average comments per issue: 0
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 0
  • Pull requests: 0
  • Average time to close issues: N/A
  • Average time to close pull requests: N/A
  • Issue authors: 0
  • Pull request authors: 0
  • Average comments per issue: 0
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
Pull Request Authors
Top Labels
Issue Labels
Pull Request Labels