repeal_the_8th_data
Anonymised Twitter data for the online discussion on the Irish abortion referendum of 2018.
Science Score: 67.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
✓DOI references
Found 1 DOI reference(s) in README -
✓Academic publication links
Links to: arxiv.org -
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (8.1%) to scientific vocabulary
Repository
Anonymised Twitter data for the online discussion on the Irish abortion referendum of 2018.
Basic Info
- Host: GitHub
- Owner: caroline-pena
- Default Branch: main
- Size: 35.6 MB
Statistics
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
- Releases: 0
Metadata Files
README.md
Finding Polarised Communities and Tracking Information Diffusion on Twitter: A Network Approach on the Irish Abortion Referendum
This repository contains the data files used in the analysis for the paper Finding Polarised Communities and Tracking Information Diffusion on Twitter: A Network Approach on the Irish Abortion Referendum. The paper is published in the Royal Society Open Science.
File description
The file RT8_mentions_anonymised.csv contains every mention in the full dataset.
The file RT8_communities_anonymised.csv contains the community membership of users in the mutual mentions network.
The file RT8_cascade_anonymised.csv contains unique IDs to identify every same text in the full dataset.
The file RT8_tweet_ID.csv contains every tweet ID gathered for the study. This allows one to retrieve the original tweets using the Twitter API.
File structure
The file RT8_mentions_anonymised.csv contains three columns:
- from: the user (pseudo-anonymised) who sends the tweet.
- to: the user (pseudo-anonymised) who receives (is mentioned in) the tweet.
- sentiment: the sentiment score of the text in the tweet obtained with AFINN[1].
The file RT8_communities_anonymised.csv contains two columns:
- user: a user (pseudo-anonymised) in the mutual mentions network, as described in the paper. The user IDs here match the ones in RT8_mentions_anonymised.csv.
- membership: the user's community membership obtained with the weighted Louvain[2] algorithm.
The file RT8_cascade_anonymised.csv contains three columns:
- user: the user (pseudo-anonymised) that posts the tweet. The user IDs here match the ones in RT8_mentions_anonymised.csv.
- created_at: the date and time when the tweet was posted.
- retweet_id: the ID of the text (equal texts contain same ID).
The file RT8_tweet_ID.csv contains a single column with the tweet IDs gathered for the study, which allows for the original tweets to be gathered using the Twitter API.
Reference
[1] Nielsen FÅ. A new ANEW: Evaluation of a word list for sentiment analysis in microblogs. arXiv preprint arXiv:11032903. 2011.
[2] Blondel VD, Guillaume JL, Lambiotte R, Lefebvre E. Fast unfolding of communities in large networks. Journal of statistical mechanics: theory and experiment. 2008;2008(10):P10008
Owner
- Login: caroline-pena
- Kind: user
- Repositories: 1
- Profile: https://github.com/caroline-pena
Citation (CITATION.cff)
cff-version: 1.1.1
message: "If you use this data, please cite it as below."
authors:
- family-names: "Pena"
given-names: "Caroline"
orcid: "https://orcid.org/0009-0007-2329-8638"
- family-names: "MacCarron"
given-names: "Padraig"
orcid: "https://orcid.org/0000-0002-5163-9264"
- family-names: "J.P. O'Sullivan"
given-names: "David"
orcid: "https://orcid.org/0000-0002-4754-3614"
title: "Twitter data on the Irish Abortion Referendum of 2018"
version: 2.0.1
url: "https://github.com/caroline-pena/Repeal_the_8th_data"
date-released: 2024-08-02
GitHub Events
Total
- Push event: 1
Last Year
- Push event: 1