Science Score: 44.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
○DOI references
-
○Academic publication links
-
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (8.2%) to scientific vocabulary
Keywords
Repository
AskDocs: A medical QA dataset
Basic Info
Statistics
- Stars: 5
- Watchers: 1
- Forks: 0
- Open Issues: 0
- Releases: 3
Topics
Metadata Files
README.md
Getting Started
| | Train | Valid | Test | External | | ----- | ------ | ----- | ---- | -------- | | en | 24256 | 5198 | 5198 | 166804 | | pt | 24256 | 5198 | 5198 | 166804 |
The dataset questions and answers span a period from January 2013 to December 2019.
We additionally translated to Portuguese and used external data from here, which is a binary classification dataset "a QNLI medical-like". We adapted to value 5 or 0.
Usage
Datasets :hugs:
```python from datasets import load_dataset
data = loaddataset("ju-resplande/askD", split="trainpt")
['trainen', 'validationen', 'testen', 'externalen', 'trainpt', 'validationpt', 'testpt', 'externalpt']
```
Citing
bibtex
@misc{Gomes20202,
author = {GOMES, J. R. S.},
title = {AskDocs: A medical QA dataset},
year = {2020},
publisher = {GitHub},
journal = {GitHub repository},
howpublished = {\url{https://github.com/ju-resplande/askD}},
commit = {42060c4402c460e174cbb75a868b429c554ba2b7}
}
Acknowledgments
@viniciusplo and @ruanchaves for giving the idea. :smiley:
Owner
- Name: Juliana Resplande
- Login: ju-resplande
- Kind: user
- Location: Brazil
- Company: Federal University of Goiás
- Website: https://ju-resplande.github.io/
- Twitter: ju_resplande
- Repositories: 9
- Profile: https://github.com/ju-resplande
Computer science undergraduate focusing on NLP
Citation (CITATION.cff)
# This CITATION.cff file was generated with cffinit.
# Visit https://bit.ly/cffinit to generate yours today!
cff-version: 1.2.0
title: 'AskDocs: A medical QA dataset'
message: >-
If you use this dataset, please cite it using the
metadata from this file.
type: dataset
authors:
- given-names: Juliana
name-particle: R. S.
family-names: Gomes
affiliation: Federal University of Goiás
orcid: 'https://orcid.org/0000-0001-6900-1931'
email: julianarsg13@gmail.com