few-shot-learning-for-issue-report-classification

This repository contains the notebook for training and testing the classifiers for our participation in the tool competition organized in the scope of the 2st International Workshop on Natural Language-based Software Engineering.

https://github.com/collab-uniba/few-shot-learning-for-issue-report-classification

Science Score: 57.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
    Found 3 DOI reference(s) in README
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (7.3%) to scientific vocabulary
Last synced: 6 months ago · JSON representation ·

Repository

This repository contains the notebook for training and testing the classifiers for our participation in the tool competition organized in the scope of the 2st International Workshop on Natural Language-based Software Engineering.

Basic Info
  • Host: GitHub
  • Owner: collab-uniba
  • License: mit
  • Language: Jupyter Notebook
  • Default Branch: main
  • Homepage:
  • Size: 45.9 KB
Statistics
  • Stars: 3
  • Watchers: 4
  • Forks: 1
  • Open Issues: 0
  • Releases: 0
Created over 3 years ago · Last pushed almost 3 years ago
Metadata Files
Readme License Citation

README.md

Few-Shot Learning for Issue Report Classification

This repository contains notebooks for training and testing the classifiers for our participation in the tool competition organized in the scope of the 2st International Workshop on Natural Language-based Software Engineering.

Our fine-tuned SETFIT model is available on HuggingFace.

How to use

  • NLBSE23_1_Preprocessing.ipynb: Runs preprocessing and saves output in order to be used from the other notebooks.
  • NLBSE23_2_RoBERTa.ipynb: Trains and tests RoBERTa model in different settings
  • NLBSE23_3_SETFIT.ipynb: Trains and tests SETFIT model in different settings

It is essential to run the notebook NLBSE23_1_Preprocessing.ipynb and store the outputs, in order to be able to run the others, as it creates and preprocess the datasets.

It is recommended to run the RoBERTa and SETFIT notebooks with a GPU. For any problem in reproducing the experiments, feel free to open an issue or contact the authors.

How to cite

To cite the manually annotated dataset: @dataset{colavito_dataset_2023, title = {Few-Shot Learning for Issue Report Classification}, author = {Colavito, Giuseppe and Lanubile, Filippo and Novielli, Nicole}, year = 2023, publisher = {Zenodo}, doi = {10.5281/zenodo.7628150}, url = {https://doi.org/10.5281/zenodo.7628150} }

To cite this repository @software{colavito_code_2023, title = {{Few-Shot Learning for Issue Report Classification}}, author = {Colavito, Giuseppe and Lanubile, Filippo and Novielli, Nicole}, year = 2023, url = {https://github.com/collab-uniba/Issue-Report-Classification-NLBSE2023}, version = {1.0.0} }

To cite the paper: ``` @inproceedings{Colavito-2023, title = {Few-Shot Learning for Issue Report Classification}, author = {Colavito, Giuseppe and Lanubile, Filippo and Novielli, Nicole}, year = 2023, booktitle = {2nd International Workshop on Natural Language-Based Software Engineering (NLBSE)} }

```

Owner

  • Name: Collaborative Development Group
  • Login: collab-uniba
  • Kind: organization
  • Email: info@peopleware.ai
  • Location: University of Bari, Italy

As a research group we address challenges that must be overcome in collaborative environments, even if distributed by time or distance

Citation (CITATION.cff)

cff-version: 1.2.0
message: "If you use this software, please cite it as below."
authors:
- family-names: "Colavito"
  given-names: "Giuseppe"
  orcid: "https://orcid.org/0000-0003-3871-401X"
- family-names: "Lanubile"
  given-names: "Filippo"
  orcid: "https://orcid.org/0000-0003-3373-7589"
- family-names: "Novielli"
  given-names: "Nicole"
  orcid: "https://orcid.org/0000-0003-1160-2608"
title: "Few-Shot Learning for Issue Report Classification"
version: 1.0.0
date-released: 2023-02-13
url: "https://github.com/collab-uniba/Issue-Report-Classification-NLBSE2023"

GitHub Events

Total
Last Year

Issues and Pull Requests

Last synced: 9 months ago

All Time
  • Total issues: 0
  • Total pull requests: 0
  • Average time to close issues: N/A
  • Average time to close pull requests: N/A
  • Total issue authors: 0
  • Total pull request authors: 0
  • Average comments per issue: 0
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 0
  • Pull requests: 0
  • Average time to close issues: N/A
  • Average time to close pull requests: N/A
  • Issue authors: 0
  • Pull request authors: 0
  • Average comments per issue: 0
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
Pull Request Authors
Top Labels
Issue Labels
Pull Request Labels