nereval-braziliancorporatetranscripts
https://github.com/rsabilio/nereval-braziliancorporatetranscripts
Science Score: 57.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
✓DOI references
Found 6 DOI reference(s) in README -
○Academic publication links
-
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (5.5%) to scientific vocabulary
Repository
Basic Info
- Host: GitHub
- Owner: rsabilio
- License: mit
- Language: Jupyter Notebook
- Default Branch: main
- Size: 82.3 MB
Statistics
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
- Releases: 0
Metadata Files
README.md
Evaluating Named Entity Recognition: A Comparative Analysis of Mono- and Multilingual Transformer Models on a Novel Brazilian Corporate Earnings Call Transcripts Dataset
This repository includes a dataset comprising 384 earnings call transcripts from Brazilian banks, along with the accompanying notebooks for preprocessing the transcripts. These notebooks are tailored for fine-tuning BERT- and T5-based models specifically for financial Named Entity Recognition (NER).
Additionally, the pre-print available at ArXiv provides a detailed overview of the comprehensive process involved in fine-tuning and comparing the BERTimbau, mBERT, PTT5, and mT5 models for financial NER.
The paper was published in the Applied Soft Computing journal. Please cite this work as follows:
Citation
```bibtex
@article{AbilioEvaluatingNamedEntity2024, author = {Ramon Abilio and Guilherme Palermo Coelho and Ana Estela Antunes {da Silva}}, title = {Evaluating Named Entity Recognition: A comparative analysis of mono- and multilingual transformer models on a novel Brazilian corporate earnings call transcripts dataset}, journal = {Applied Soft Computing}, pages = {112158}, year = {2024}, doi = {https://doi.org/10.1016/j.asoc.2024.112158}, }
Owner
- Name: Ramon Abilio
- Login: rsabilio
- Kind: user
- Company: IFSP - Campus Boituva
- Repositories: 1
- Profile: https://github.com/rsabilio
Citation (CITATION.cff)
cff-version: 1.2.0
message: "If you use this dataset or research, please cite it as below."
authors:
- family-names: "Abilio"
given-names: "Ramon"
- family-names: "Coelho"
given-names: "Guilherme Palermo"
- family-names: "Antunes da Silva"
given-names: "Ana Estela"
title: "Dataset and Research on Named Entity Recognition in Brazilian Corporate Earnings Call Transcripts"
version: "1.0.0"
date-released: 2024-01-01
doi: "10.1016/j.asoc.2024.112158"
url: "https://www.sciencedirect.com/science/article/pii/S1568494624009323"
preferred-citation:
type: article
authors:
- family-names: "Abilio"
given-names: "Ramon"
- family-names: "Coelho"
given-names: "Guilherme Palermo"
- family-names: "Antunes da Silva"
given-names: "Ana Estela"
title: "Evaluating Named Entity Recognition: A comparative analysis of mono- and multilingual transformer models on a novel Brazilian corporate earnings call transcripts dataset"
journal: "Applied Soft Computing"
volume:
issue:
pages: "112158"
year: 2024
doi: "10.1016/j.asoc.2024.112158"
GitHub Events
Total
- Push event: 1
Last Year
- Push event: 1