masked-absa

Masking The Bias : From Echo Chambers to Large Scale Aspect-Based Sentiment Analysis

https://github.com/tweetpie/masked-absa

Science Score: 57.0%

This score indicates how likely this project is to be science-related based on various indicators:

✓
CITATION.cff file
Found CITATION.cff file
✓
codemeta.json file
Found codemeta.json file
✓
.zenodo.json file
Found .zenodo.json file
✓
DOI references
Found 7 DOI reference(s) in README
○
Academic publication links
○
Academic email domains
○
Institutional organization owner
○
JOSS paper metadata
○
Scientific vocabulary similarity
Low similarity (11.7%) to scientific vocabulary

Keywords

absa aspect-based-sentiment-analysis masked-absa natural-language-processing

Last synced: 11 months ago · JSON representation ·

Repository

Masking The Bias : From Echo Chambers to Large Scale Aspect-Based Sentiment Analysis

Basic Info

Host: GitHub
Owner: tweetpie
Default Branch: main
Homepage:
Size: 1.94 MB

Statistics

Stars: 2
Watchers: 1
Forks: 0
Open Issues: 0
Releases: 0

Topics

absa aspect-based-sentiment-analysis masked-absa natural-language-processing

Created about 2 years ago · Last pushed over 1 year ago

Metadata Files

Readme Citation

Masking The Bias : From Echo Chambers to Large Scale Aspect-Based Sentiment Analysis

https://dx.doi.org/10.1007/978-3-031-78538-2_19

Welcome to the Masked Aspect-Based Sentiment Analysis repository. This project addresses the challenges of scarcity and high costs associated with manual annotation in Aspect-Based Sentiment Analysis (ABSA) by utilizing weak supervision methods. We have demonstrated the superiority of our model using SemEval datasets and real-world tweet data.

Masked Aspect Sentiment Classification (MASC)

Aspect-based stance detection identifies the stance towards each aspect term in a sentence.

sentence

In Aspect Term Sentiment Classification, current models often overemphasize the presence of aspect terms and the majority sentiment associated with them, ignoring the broader sentence context. This issue is particularly problematic with real-world datasets that have skewed or missing labels for specific aspects, causing models to memorize and repeatedly predict the same sentiment for certain words. The MaskedABSA method addresses these issues by replacing aspect terms with a placeholder token "[MASK]."

MASC

Weak Supervision Method

US_races

An effective strategy to address the scarcity of ABSA datasets and bias involves using weak supervision with social network structures, like retweet networks. By analyzing retweet patterns, sentiment and aspect orientations within specific groups can be inferred without detailed manual annotation. This approach creates large-scale, weakly labeled datasets that can be refined with machine learning and validated through targeted reviews. A well-defined codebook enhances the efficiency and consistency of labeling by providing guidelines for detecting stances based on community orientations. This method reduces the costs of creating labeled datasets and captures a wide range of aspects and sentiments across diverse groups and sub-communities.

Datasets

The study incorporates following datasets :

SemEval dataset

SemEval 2014, 2015 and 2016 Stance detection dataset ||Laptop14|Restaurant 14|Restaurant 15|Restaurant16| |:--------:|:--------:|:--------:|:--------:|:--------:| |MaskedABSA|86.24|87.65|91.53|94.83|

US Race Relations Datase

Race and Politics datasets distribution |Dataset|Positive|Negative|Total| |:----:|:----:|:----:|:----:| |Races|Train|885|2115|3000| | |Test|309|699|1000| |Politics|Train|873|2122|2995| | |Test|257|751|1008| * politic_dataset.zip : Twitter text data related to the political terms. * race_dataset.zip : Twitter text data related to the races terms.

Models

The models used in this study are available on Huggingface.

Contributions and Feedback

We encourage contributions and feedback to improve this project. If you have suggestions or want to contribute, please open an issue or pull request on our GitHub repository.

Citation

```bibtex @inproceedings{maskedabsa2025, title={Masking the Bias: From Echo Chambers to Large Scale Aspect-Based Sentiment Analysis}, author={Lee, Yeonjung and {\c{C}}etinkaya, Yusuf M{\"u}cahit and K{\"u}lah, Emre and Toroslu, {.I}smail Hakk{\i} and Davulcu, Hasan}, booktitle="Social Networks Analysis and Mining (ASONAM 2024)", year="2025", publisher="Springer Nature Switzerland", pages="214--225", doi="10.1007/978-3-031-78538-2_19" }

Owner

Login: tweetpie
Kind: user

Repositories: 1
Profile: https://github.com/tweetpie

Citation (CITATION.cff)

@inproceedings{ title={Masking The Bias : From Echo Chambers to Large Scale Aspect-Based Sentiment Analysis}, 
author={Yeonjung Lee, Yusuf \c{C}etinkaya, Emre K"{u}lah, Hakk{\i} Toroslu and Hasan Davulcu}, 
booktitle={}, 
pages={}, 
year={}, 
organization={} }

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Open Source Science

masked-absa

Science Score: 57.0%

Keywords

Repository

Basic Info

Statistics

Topics

Metadata Files

README.md

Masking The Bias : From Echo Chambers to Large Scale Aspect-Based Sentiment Analysis

Masked Aspect Sentiment Classification (MASC)

Weak Supervision Method

Datasets

SemEval dataset

US Race Relations Datase

Models

Contributions and Feedback

Citation

Owner

Citation (CITATION.cff)

GitHub Events

Total

Last Year