Science Score: 57.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
    Found 6 DOI reference(s) in README
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (5.5%) to scientific vocabulary
Last synced: 10 months ago · JSON representation ·

Repository

Basic Info
  • Host: GitHub
  • Owner: rsabilio
  • License: mit
  • Language: Jupyter Notebook
  • Default Branch: main
  • Size: 82.3 MB
Statistics
  • Stars: 0
  • Watchers: 1
  • Forks: 0
  • Open Issues: 0
  • Releases: 0
Created over 2 years ago · Last pushed over 1 year ago
Metadata Files
Readme License Citation

README.md

Evaluating Named Entity Recognition: A Comparative Analysis of Mono- and Multilingual Transformer Models on a Novel Brazilian Corporate Earnings Call Transcripts Dataset

This repository includes a dataset comprising 384 earnings call transcripts from Brazilian banks, along with the accompanying notebooks for preprocessing the transcripts. These notebooks are tailored for fine-tuning BERT- and T5-based models specifically for financial Named Entity Recognition (NER).

Additionally, the pre-print available at ArXiv provides a detailed overview of the comprehensive process involved in fine-tuning and comparing the BERTimbau, mBERT, PTT5, and mT5 models for financial NER.

The paper was published in the Applied Soft Computing journal. Please cite this work as follows:

Citation

```bibtex

@article{AbilioEvaluatingNamedEntity2024, author = {Ramon Abilio and Guilherme Palermo Coelho and Ana Estela Antunes {da Silva}}, title = {Evaluating Named Entity Recognition: A comparative analysis of mono- and multilingual transformer models on a novel Brazilian corporate earnings call transcripts dataset}, journal = {Applied Soft Computing}, pages = {112158}, year = {2024}, doi = {https://doi.org/10.1016/j.asoc.2024.112158}, }

Owner

  • Name: Ramon Abilio
  • Login: rsabilio
  • Kind: user
  • Company: IFSP - Campus Boituva

Citation (CITATION.cff)

cff-version: 1.2.0
message: "If you use this dataset or research, please cite it as below."
authors:
  - family-names: "Abilio"
    given-names: "Ramon"
  - family-names: "Coelho"
    given-names: "Guilherme Palermo"
  - family-names: "Antunes da Silva"
    given-names: "Ana Estela"
title: "Dataset and Research on Named Entity Recognition in Brazilian Corporate Earnings Call Transcripts"
version: "1.0.0"
date-released: 2024-01-01
doi: "10.1016/j.asoc.2024.112158"
url: "https://www.sciencedirect.com/science/article/pii/S1568494624009323"
preferred-citation:
  type: article
  authors:
    - family-names: "Abilio"
      given-names: "Ramon"
    - family-names: "Coelho"
      given-names: "Guilherme Palermo"
    - family-names: "Antunes da Silva"
      given-names: "Ana Estela"
  title: "Evaluating Named Entity Recognition: A comparative analysis of mono- and multilingual transformer models on a novel Brazilian corporate earnings call transcripts dataset"
  journal: "Applied Soft Computing"
  volume: 
  issue: 
  pages: "112158"
  year: 2024
  doi: "10.1016/j.asoc.2024.112158"

GitHub Events

Total
  • Push event: 1
Last Year
  • Push event: 1