https://github.com/aryashah2k/nlp-data-augmentation

Implementing 5 Different Approaches To Augmenting Data For Natural Language Processing Tasks.

https://github.com/aryashah2k/nlp-data-augmentation

Science Score: 10.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
  • .zenodo.json file
  • DOI references
  • Academic publication links
  • Committers with academic emails
    1 of 1 committers (100.0%) from academic institutions
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (3.6%) to scientific vocabulary

Keywords

back-translation bert data-augmentation ensemble natural-language-processing t5-model text-to-text-transfer-transformer word-embeddings
Last synced: 5 months ago · JSON representation

Repository

Implementing 5 Different Approaches To Augmenting Data For Natural Language Processing Tasks.

Basic Info
  • Host: GitHub
  • Owner: aryashah2k
  • License: mit
  • Language: Jupyter Notebook
  • Default Branch: main
  • Homepage:
  • Size: 2.51 MB
Statistics
  • Stars: 10
  • Watchers: 1
  • Forks: 0
  • Open Issues: 0
  • Releases: 0
Topics
back-translation bert data-augmentation ensemble natural-language-processing t5-model text-to-text-transfer-transformer word-embeddings
Created over 3 years ago · Last pushed over 3 years ago
Metadata Files
Readme License

README.md

Data Augmentation Techniques For Natural Language Processing


This repository takes into account 5 different Data Augmentation Techniques aimed to improve the performance of your models.

The five techniques included in this repository are:

|No.|Technique|Link To Folder| |---|---------|--------------| |1.|Word Embeddings|Click Here| |2.|BERT|Click Here| |3.|Back Translation|Click Here| |4.|Text To Text Transformer|Click Here| |5.|Ensemble Approach|Click Here|


Collective Comparison Of The 5 Approaches

--TO DO--


Completion Log:

[x] Word Embeddings

[-] BERT

[-] Back Translation

[-] Text To text Transformer

[-] Ensemble Approach

Owner

  • Name: Arya Shah
  • Login: aryashah2k
  • Kind: user
  • Location: Mumbai, India
  • Company: IIT Gandhinagar

Artificial Intelligence Engineer & Researcher

GitHub Events

Total
Last Year

Committers

Last synced: 8 months ago

All Time
  • Total Commits: 30
  • Total Committers: 1
  • Avg Commits per committer: 30.0
  • Development Distribution Score (DDS): 0.0
Past Year
  • Commits: 0
  • Committers: 0
  • Avg Commits per committer: 0.0
  • Development Distribution Score (DDS): 0.0
Top Committers
Name Email Commits
Arya Shah a****2@n****n 30
Committer Domains (Top 20 + Academic)

Issues and Pull Requests

Last synced: 8 months ago

All Time
  • Total issues: 0
  • Total pull requests: 0
  • Average time to close issues: N/A
  • Average time to close pull requests: N/A
  • Total issue authors: 0
  • Total pull request authors: 0
  • Average comments per issue: 0
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 0
  • Pull requests: 0
  • Average time to close issues: N/A
  • Average time to close pull requests: N/A
  • Issue authors: 0
  • Pull request authors: 0
  • Average comments per issue: 0
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
Pull Request Authors
Top Labels
Issue Labels
Pull Request Labels