masked-absa

Masking The Bias : From Echo Chambers to Large Scale Aspect-Based Sentiment Analysis

https://github.com/tweetpie/masked-absa

Science Score: 57.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
    Found 7 DOI reference(s) in README
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (11.7%) to scientific vocabulary

Keywords

absa aspect-based-sentiment-analysis masked-absa natural-language-processing
Last synced: 9 months ago · JSON representation ·

Repository

Masking The Bias : From Echo Chambers to Large Scale Aspect-Based Sentiment Analysis

Basic Info
  • Host: GitHub
  • Owner: tweetpie
  • Default Branch: main
  • Homepage:
  • Size: 1.94 MB
Statistics
  • Stars: 2
  • Watchers: 1
  • Forks: 0
  • Open Issues: 0
  • Releases: 0
Topics
absa aspect-based-sentiment-analysis masked-absa natural-language-processing
Created about 2 years ago · Last pushed over 1 year ago
Metadata Files
Readme Citation

README.md

Masking The Bias : From Echo Chambers to Large Scale Aspect-Based Sentiment Analysis

https://dx.doi.org/10.1007/978-3-031-78538-2_19

Welcome to the Masked Aspect-Based Sentiment Analysis repository. This project addresses the challenges of scarcity and high costs associated with manual annotation in Aspect-Based Sentiment Analysis (ABSA) by utilizing weak supervision methods. We have demonstrated the superiority of our model using SemEval datasets and real-world tweet data.

Masked Aspect Sentiment Classification (MASC)

Aspect-based stance detection identifies the stance towards each aspect term in a sentence.

sentence

In Aspect Term Sentiment Classification, current models often overemphasize the presence of aspect terms and the majority sentiment associated with them, ignoring the broader sentence context. This issue is particularly problematic with real-world datasets that have skewed or missing labels for specific aspects, causing models to memorize and repeatedly predict the same sentiment for certain words. The MaskedABSA method addresses these issues by replacing aspect terms with a placeholder token "[MASK]."

MASC

Weak Supervision Method

US_races

An effective strategy to address the scarcity of ABSA datasets and bias involves using weak supervision with social network structures, like retweet networks. By analyzing retweet patterns, sentiment and aspect orientations within specific groups can be inferred without detailed manual annotation. This approach creates large-scale, weakly labeled datasets that can be refined with machine learning and validated through targeted reviews. A well-defined codebook enhances the efficiency and consistency of labeling by providing guidelines for detecting stances based on community orientations. This method reduces the costs of creating labeled datasets and captures a wide range of aspects and sentiments across diverse groups and sub-communities.

Datasets

The study incorporates following datasets :

SemEval dataset

SemEval 2014, 2015 and 2016 Stance detection dataset ||Laptop14|Restaurant 14|Restaurant 15|Restaurant16| |:--------:|:--------:|:--------:|:--------:|:--------:| |MaskedABSA|86.24|87.65|91.53|94.83|

US Race Relations Datase

Race and Politics datasets distribution |Dataset|Positive|Negative|Total| |:----:|:----:|:----:|:----:| |Races|Train|885|2115|3000| | |Test|309|699|1000| |Politics|Train|873|2122|2995| | |Test|257|751|1008| * politic_dataset.zip : Twitter text data related to the political terms. * race_dataset.zip : Twitter text data related to the races terms.

Models

The models used in this study are available on Huggingface.

|Dataset|Model| |:----:|:----:| |Restaurant14|masked-absa-rest14| |Laptop14|masked-absa-rest14| |Restaurant15|masked-absa-rest15| |Restaurant16|masked-absa-rest15| |Race|masked-absa-race| |Politics|masked-absa-politics|

Contributions and Feedback

We encourage contributions and feedback to improve this project. If you have suggestions or want to contribute, please open an issue or pull request on our GitHub repository.

Citation

```bibtex @inproceedings{maskedabsa2025, title={Masking the Bias: From Echo Chambers to Large Scale Aspect-Based Sentiment Analysis}, author={Lee, Yeonjung and {\c{C}}etinkaya, Yusuf M{\"u}cahit and K{\"u}lah, Emre and Toroslu, {.I}smail Hakk{\i} and Davulcu, Hasan}, booktitle="Social Networks Analysis and Mining (ASONAM 2024)", year="2025", publisher="Springer Nature Switzerland", pages="214--225", doi="10.1007/978-3-031-78538-2_19" }

Owner

  • Login: tweetpie
  • Kind: user

Citation (CITATION.cff)

@inproceedings{ title={Masking The Bias : From Echo Chambers to Large Scale Aspect-Based Sentiment Analysis}, 
author={Yeonjung Lee, Yusuf \c{C}etinkaya, Emre K"{u}lah, Hakk{\i} Toroslu and Hasan Davulcu}, 
booktitle={}, 
pages={}, 
year={}, 
organization={} }

GitHub Events

Total
  • Push event: 1
Last Year
  • Push event: 1