a-multi-modal-transformer-architecture-combining-sentiment-dynamics-temporal-market-data

Our approach uniquely fuses sentiment dynamics from social media and news sources with temporal market data and macroeconomic indicators to construct dynamic graph representations of interfirm relationships. Further, we employ state-of-the-art GNNs, such as temporal graph convolutions, that adapt to the changing market and significantly enhance it.

https://github.com/abhishekjoshi007/a-multi-modal-transformer-architecture-combining-sentiment-dynamics-temporal-market-data

Science Score: 49.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
    Found 1 DOI reference(s) in README
  • Academic publication links
    Links to: ieee.org
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (11.1%) to scientific vocabulary

Keywords

graphneuralnetwork ieee multimodal-deep-learning stock-data stock-prediction stock-price-prediction
Last synced: 6 months ago · JSON representation

Repository

Our approach uniquely fuses sentiment dynamics from social media and news sources with temporal market data and macroeconomic indicators to construct dynamic graph representations of interfirm relationships. Further, we employ state-of-the-art GNNs, such as temporal graph convolutions, that adapt to the changing market and significantly enhance it.

Basic Info
Statistics
  • Stars: 0
  • Watchers: 1
  • Forks: 0
  • Open Issues: 0
  • Releases: 0
Topics
graphneuralnetwork ieee multimodal-deep-learning stock-data stock-prediction stock-price-prediction
Created over 1 year ago · Last pushed 9 months ago
Metadata Files
Readme License Citation

README.md

Multi-Modal Stock Return Prediction using Transformers and Graph Neural Networks

This repository contains the official implementation of the paper:

"A Multi-Modal Transformer Architecture Combining Sentiment Dynamics, Temporal Market Data, and Macroeconomic Indicators for Sturdy Stock Return Forecasting"
Presented at IEEE International Conference on Big Data (BigData) 2024

Link- https://ieeexplore.ieee.org/document/10825219

Overview

This work proposes a robust, multi-modal forecasting framework that predicts short-term stock returns by integrating:

  • Historical Market Data (OHLCV)
  • Stock Descriptions
  • Shareholder Information
  • Sentiment Scores (from news + social media)
  • Dynamic Inter-stock Relations via Graphs

Our architecture uses Graph Neural Networks (GraphSAGE, Node2Vec), temporal encoders (LSTM, TGN), and a multi-modal Transformer fusion mechanism to deliver state-of-the-art return prediction accuracy and Sharpe ratio performance.

Architecture Overview

Model Pipeline

Modules:

  • Input Streams:
    • Historical Sequences
    • Description Documents
    • Shareholder Records
    • Sentiment Time Series
  • Feature Encoders:
    • Sequence Encoder for price data
    • Text encoders for stock description
    • Graph Construction for stock relations
  • Graph Embedding:
    • Node2Vec / GraphSAGE
  • Temporal Modeling:
    • LSTM / TGN / GAT
  • Output:
    • Next-day return prediction
    • Stock ranking based on expected returns

Project Structure

bash A-Multi-Modal-Transformer-Architecture-Combining-Sentiment-Dynamics-Temporal-Market-Data/ Base-Line Models/ # Baseline models (Random Forest, Linear Regression) Data collection Scripts/ # Scripts to fetch, clean, and align data Graph Creation/ # Dynamic graph construction Main Models & Node Embedding/ # Transformer, LSTM, TGN, Node2Vec, GraphSAGE Recommendation Scripts/ # Stock ranking logic csv/ # Preprocessed CSV datasets utils/ # Figures, visuals, supporting artifacts dataset.png metrics_1.png metrics_2.png Model_Architecture.png LICENSE Presentation.pptx README.md

Dataset

We curated a 6-month dataset of 716 technology-sector stocks with:

  • Daily OHLCV values
  • Sector and industry descriptions
  • Top institutional holders
  • Sentiment from Twitter & financial news (BERT-based)
  • Generated dynamic inter-stock graphs based on industry & common holders

Datset

Problem Statement

Problem Formulation

Evaluation Metrics

| Metric | Description | |-------------------------|--------------------------------------------------| | Accuracy | Correct movement prediction | | F1 Score | Balance of precision and recall | | Information Coefficient (IC) | Pearson correlation with real returns | | Rank IC | Spearman correlation on ranking of returns | | Sharpe Ratio | Risk-adjusted return | | Long-Short Avg Return | Return from top vs bottom stock strategy |

Results Snapshot

Evaluation

Setup & Run

Requirements

  • Python 3.8+
  • PyTorch, PyTorch Geometric
  • Transformers (HuggingFace)
  • scikit-learn, pandas, numpy

Installation

bash git clone https://github.com/abhishekjoshi007/A-Multi-Modal-Transformer-Architecture-Combining-Sentiment-Dynamics-Temporal-Market-Data cd A-Multi-Modal-Transformer-Architecture-Combining-Sentiment-Dynamics-Temporal-Market-Data pip install -r requirements.txt `

Training

bash python train.py --model tgn --graph graphsage --epochs 100

Arguments:

  • --model: lstm, gat, tgn, d-gcn
  • --graph: node2vec, graphsage

Citation

If you use this work, please cite:

bibtex @INPROCEEDINGS{10825219, author={Joshi, Abhishek and Koda, Jahnavi Krishna and Hadimlioglu, Alihan}, booktitle={2024 IEEE International Conference on Big Data (BigData)}, title={A Multi-Modal Transformer Architecture Combining Sentiment Dynamics, Temporal Market Data, and Macroeconomic Indicators for Sturdy Stock Return Forecasting}, year={2024}, pages={4896-4902}, doi={10.1109/BigData62323.2024.10825219} }

Contact

For questions, collaborations, or feedback:

Acknowledgments

Special thanks to:

  • Texas A&M UniversityCorpus Christi
  • IEEE Big Data 2024 Committee
  • Open-source contributors to PyTorch Geometric and HuggingFace

Owner

  • Name: Abhishek Joshi
  • Login: abhishekjoshi007
  • Kind: user
  • Location: Texas
  • Company: Texas A&M Corpus Christi CS Grad

Software Developer | Machine Learning Enthusiast.

GitHub Events

Total
  • Push event: 28
  • Fork event: 1
Last Year
  • Push event: 28
  • Fork event: 1