a-multi-modal-transformer-architecture-combining-sentiment-dynamics-temporal-market-data
Our approach uniquely fuses sentiment dynamics from social media and news sources with temporal market data and macroeconomic indicators to construct dynamic graph representations of interfirm relationships. Further, we employ state-of-the-art GNNs, such as temporal graph convolutions, that adapt to the changing market and significantly enhance it.
Science Score: 49.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
○CITATION.cff file
-
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
✓DOI references
Found 1 DOI reference(s) in README -
✓Academic publication links
Links to: ieee.org -
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (11.1%) to scientific vocabulary
Keywords
Repository
Our approach uniquely fuses sentiment dynamics from social media and news sources with temporal market data and macroeconomic indicators to construct dynamic graph representations of interfirm relationships. Further, we employ state-of-the-art GNNs, such as temporal graph convolutions, that adapt to the changing market and significantly enhance it.
Basic Info
- Host: GitHub
- Owner: abhishekjoshi007
- License: mit
- Language: Python
- Default Branch: main
- Homepage: https://ieeexplore.ieee.org/document/10825219
- Size: 10.7 MB
Statistics
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
- Releases: 0
Topics
Metadata Files
README.md
Multi-Modal Stock Return Prediction using Transformers and Graph Neural Networks
This repository contains the official implementation of the paper:
"A Multi-Modal Transformer Architecture Combining Sentiment Dynamics, Temporal Market Data, and Macroeconomic Indicators for Sturdy Stock Return Forecasting"
Presented at IEEE International Conference on Big Data (BigData) 2024
Link- https://ieeexplore.ieee.org/document/10825219
Overview
This work proposes a robust, multi-modal forecasting framework that predicts short-term stock returns by integrating:
- Historical Market Data (OHLCV)
- Stock Descriptions
- Shareholder Information
- Sentiment Scores (from news + social media)
- Dynamic Inter-stock Relations via Graphs
Our architecture uses Graph Neural Networks (GraphSAGE, Node2Vec), temporal encoders (LSTM, TGN), and a multi-modal Transformer fusion mechanism to deliver state-of-the-art return prediction accuracy and Sharpe ratio performance.
Architecture Overview

Modules:
- Input Streams:
- Historical Sequences
- Description Documents
- Shareholder Records
- Sentiment Time Series
- Feature Encoders:
- Sequence Encoder for price data
- Text encoders for stock description
- Graph Construction for stock relations
- Graph Embedding:
- Node2Vec / GraphSAGE
- Temporal Modeling:
- LSTM / TGN / GAT
- Output:
- Next-day return prediction
- Stock ranking based on expected returns
Project Structure
bash
A-Multi-Modal-Transformer-Architecture-Combining-Sentiment-Dynamics-Temporal-Market-Data/
Base-Line Models/ # Baseline models (Random Forest, Linear Regression)
Data collection Scripts/ # Scripts to fetch, clean, and align data
Graph Creation/ # Dynamic graph construction
Main Models & Node Embedding/ # Transformer, LSTM, TGN, Node2Vec, GraphSAGE
Recommendation Scripts/ # Stock ranking logic
csv/ # Preprocessed CSV datasets
utils/ # Figures, visuals, supporting artifacts
dataset.png
metrics_1.png
metrics_2.png
Model_Architecture.png
LICENSE
Presentation.pptx
README.md
Dataset
We curated a 6-month dataset of 716 technology-sector stocks with:
- Daily OHLCV values
- Sector and industry descriptions
- Top institutional holders
- Sentiment from Twitter & financial news (BERT-based)
- Generated dynamic inter-stock graphs based on industry & common holders

Problem Statement

Evaluation Metrics
| Metric | Description | |-------------------------|--------------------------------------------------| | Accuracy | Correct movement prediction | | F1 Score | Balance of precision and recall | | Information Coefficient (IC) | Pearson correlation with real returns | | Rank IC | Spearman correlation on ranking of returns | | Sharpe Ratio | Risk-adjusted return | | Long-Short Avg Return | Return from top vs bottom stock strategy |
Results Snapshot

Setup & Run
Requirements
- Python 3.8+
- PyTorch, PyTorch Geometric
- Transformers (HuggingFace)
- scikit-learn, pandas, numpy
Installation
bash
git clone https://github.com/abhishekjoshi007/A-Multi-Modal-Transformer-Architecture-Combining-Sentiment-Dynamics-Temporal-Market-Data
cd A-Multi-Modal-Transformer-Architecture-Combining-Sentiment-Dynamics-Temporal-Market-Data
pip install -r requirements.txt
`
Training
bash
python train.py --model tgn --graph graphsage --epochs 100
Arguments:
--model:lstm,gat,tgn,d-gcn--graph:node2vec,graphsage
Citation
If you use this work, please cite:
bibtex
@INPROCEEDINGS{10825219,
author={Joshi, Abhishek and Koda, Jahnavi Krishna and Hadimlioglu, Alihan},
booktitle={2024 IEEE International Conference on Big Data (BigData)},
title={A Multi-Modal Transformer Architecture Combining Sentiment Dynamics, Temporal Market Data, and Macroeconomic Indicators for Sturdy Stock Return Forecasting},
year={2024},
pages={4896-4902},
doi={10.1109/BigData62323.2024.10825219}
}
Contact
For questions, collaborations, or feedback:
Acknowledgments
Special thanks to:
- Texas A&M UniversityCorpus Christi
- IEEE Big Data 2024 Committee
- Open-source contributors to PyTorch Geometric and HuggingFace
Owner
- Name: Abhishek Joshi
- Login: abhishekjoshi007
- Kind: user
- Location: Texas
- Company: Texas A&M Corpus Christi CS Grad
- Website: abhishek.07joshi@gmail.com
- Twitter: abhishek_7joshi
- Repositories: 2
- Profile: https://github.com/abhishekjoshi007
Software Developer | Machine Learning Enthusiast.
GitHub Events
Total
- Push event: 28
- Fork event: 1
Last Year
- Push event: 28
- Fork event: 1