gnns_powergraph

https://github.com/maaguado/gnns_powergraph

Science Score: 49.0%

This score indicates how likely this project is to be science-related based on various indicators:

○
CITATION.cff file
✓
codemeta.json file
Found codemeta.json file
✓
.zenodo.json file
Found .zenodo.json file
✓
DOI references
Found 1 DOI reference(s) in README
✓
Academic publication links
Links to: arxiv.org, acm.org
○
Academic email domains
○
Institutional organization owner
○
JOSS paper metadata
○
Scientific vocabulary similarity
Low similarity (12.6%) to scientific vocabulary

Last synced: 10 months ago · JSON representation

Repository

Basic Info

Host: GitHub
Owner: maaguado
Language: Jupyter Notebook
Default Branch: main
Size: 489 MB

Statistics

Stars: 0
Watchers: 1
Forks: 0
Open Issues: 0
Releases: 0

Created about 2 years ago · Last pushed over 1 year ago

Metadata Files

Readme Citation

Master Thesis - Forecasting and Classification of Incidents in Electrical Networks Using Temporal Neural Networks on Graphs

Overview

This repository contains the code and documentation for my Master Thesis project, which focuses on conducting experiments with different temporal GNNs architectures, applied to two different problems: classification and forecasting of incidents in electrical networks.

The data used in this project is a dataset obtained by simulating different kinds of incidents in a distribution and transmission power grid, originally presented by the paper "A Multi-scale Time-series Dataset with Benchmark for Machine Learning in Decarbonized Energy Grids" (https://arxiv.org/abs/2110.06324).

Summary and Background

The significant climate transformations we have observed and anticipated for the 21st century, including global warming, as a consequence of over a century of net greenhouse gas emissions, reflect profound changes that have manifested over the past 65 years.

Energy-related greenhouse gas emissions are the largest contributor to climate warming in recent years. For this reason, the role of advanced technologies becomes crucial in addressing and mitigating incidents in electrical networks, as these networks are utilized in most sectors and have the potential to generate a positive impact on them.

In this study, experiments have been conducted with a power grid transmission system through various network disturbance simulations, specifically focusing on five types of issues: generator trip, node trip, node fault, branch trip, and branch fault. Each simulation has been modeled as a sequence of graphs, and the experiments have been conducted using neural network technologies specifically designed for dynamic graphs.

The objective has been to solve two main problems: first, the identification of incident types using historical network data; second, the development of a predictive system that, for a specific type of incident, predicts the future state of the network, aiming to anticipate conditions of instability in the electrical grid to facilitate effective preventive actions.

The experimentation included graph neural network architectures based on recurrent networks (AGCRN, DyGrEncoder, MPNN-LSTM, EvolveGCN, and DCRNN), and architectures based on attention mechanisms (ASTGCN, MSTGCN, MTGNN, and STConv). For each of these, an adaptation process was necessary to incorporate the structural information of the graph over time. Additionally, in both the classification and regression problems, the results obtained with a simple LSTM architecture were included as a reference, and they were compared using appropriate metrics in each case.

Project Description

This project is composed by the following stages:

Data Preparation: software development for data preparation, in order to generate the powergrid dataset for the experiments, from the files obtained in the dataset paper.
Model Development: development of different models based on GNNs, based on the implementation of Pytorch Geometric Temporal, with a focus on the adaptation of each model to the modular software developed for the project and the specifics of the dataset, such as the inclusion of grid interactions.
Generalized Training Class: we have developed a modular software that allows us to train the models implemented. The heart of the system is a training class which is responsible for managing the entire lifecycle of the models, from data loading, training, and validation to evaluation and result export. This class has been designed to be highly flexible and extensible, allowing new models to be incorporated with minimal modification to the existing code.
Experiments: We have tested all the models in the dataset, fine-tuned their parameters, and we have compared the results with a simple LSTM model. The results are presented in the study, and they are compared using appropriate metrics in each problem.

Repository Structure and Description of main classes

Data Preparation

| File Name | Description | Stage and Type| | --- | --- | --- | | Data Preprocessing and EDA Notebook | Initial experiment with dataloader class and EDA | | | DataLoader Class | Includes preliminary experiments for different types of time series and missing data distribution | |

PowerGridDatasetLoader Class

The PowerGridDatasetLoader class is designed to handle the loading and preprocessing of power grid datasets to use in machine learning tasks, specifically regression and classification, ensuring that the data is correctly formatted and preprocessed before it is fed into the model. Below is an outline of its attributes:

naturalfolder: A string representing the path to the directory containing the dataset files.
problem: A string that indicates the type of problem being addressed, either "regression" or "classification".
voltages: A dictionary that stores voltage data for each node in the power grid.
buses: A list of bus numbers representing different nodes in the power grid.
types: A list that holds the type of each incident.
edge_attr: A list of attributes associated with the edges in the graph representation of the power grid.
edge_index: A list of indices that represent the connections (edges) between nodes (buses) in the grid.
processed: A boolean flag indicating whether the data has been processed.
transformation_dict: A dictionary used for transforming node identifiers.

Model Development

| File Name | Description | Stage and Type| | --- | --- | --- | | Models Implementations | Includes the implementation in PyTorch for each one of the models included in the study| |

In particular, the models included are:

Recurrent Graph Convolutions

DCRNN from Li et al.: Diffusion Convolutional Recurrent Neural Network: Data-Driven Traffic Forecasting (ICLR 2018)
DyGrEncoder from Taheri et al.: Learning to Represent the Evolution of Dynamic Graphs with Recurrent Models
EvolveGCNO from Pareja et al.: EvolveGCN: Evolving Graph Convolutional Networks for Dynamic Graphs
AGCRN from Bai et al.: Adaptive Graph Convolutional Recurrent Network for Traffic Forecasting (NeurIPS 2020)
MPNN LSTM from Panagopoulos et al.: Transfer Graph Neural Networks for Pandemic Forecasting (AAAI 2021)

Attention Aggregated Temporal Graph Convolutions

STGCN from Yu et al.: Spatio-Temporal Graph Convolutional Networks: A Deep Learning Framework for Traffic Forecasting (IJCAI 2018)
ASTGCN from Guo et al.: Attention Based Spatial-Temporal Graph Convolutional Networks for Traffic Flow Forecasting (AAAI 2019)
MSTGCN from Guo et al.: Attention Based Spatial-Temporal Graph Convolutional Networks for Traffic Flow Forecasting (AAAI 2019)
MTGNN from Wu et al.: Connecting the Dots: Multivariate Time Series Forecasting with Graph Neural Networks (KDD 2020)

Generalized Training Class

| File Name | Description | Stage and Type| | --- | --- | --- | | Trainer Class | includes the implementation of the trainer class, which manages the entire lifecycle of the models, from data loading, training, and validation to evaluation and result export| |

The trainer.py file serves as the core of a flexible and extensible training framework designed to handle the training, validation, and evaluation processes of various machine learning models. At the heart of this framework is the TrainerModel class, a generalized trainer class that provides the foundational structure and methods required for managing the training lifecycle. This includes tasks such as data loading, model initialization, optimization, and performance tracking.

The TrainerModel class is engineered to be highly adaptable, allowing it to serve as a base class from which more specialized trainer classes are derived. Each of these derived classes is tailored to accommodate the unique training requirements of different model architectures, such as LSTM networks, Graph Convolutional Networks (GCN), and other advanced models that integrate temporal and graph-based learning. By leveraging the TrainerModel as a common framework, the software reduces redundancy and ensures consistency across the training processes of diverse models. This approach not only streamlines the integration of new models but also enhances the maintainability of the training pipeline, allowing each model-specific trainer class to focus on the intricacies of its respective architecture while relying on the generalized TrainerModel class for the overall training logic.

Experiments

As we mentioned earlier, we have conducted experiments in both regression and classification problems, using the models implemented. For the regression problem, however, we have separated each one of the simulations by incident type (gen trip, bus trip, bus fault, branch trip and branch fault), since the variability of the data is very high and the nature of the incidents is very different.

Also, due to computational limitations, for some of the models, we have only been able to tune the hyperparameters for a subset the parameter space. This information is included in the table below.

Regression

| File Name | Type of Parameter Tuning | Type | | --- | --- | --- | | DCRNN Notebook | | | | A3TGCN Notebook | | | | AGCRN Notebook | | | | MSTGCN Notebook | | | | ASTGCN Notebook | | | | DyGrEncoder Notebook | | | | EvolveGCN Notebook | | | | MPNN LSTM Notebook | | | | STConv Notebook | | |

Owner

Login: maaguado
Kind: user

Repositories: 1
Profile: https://github.com/maaguado

GitHub Events

Total

Public event: 1
Push event: 1

Last Year

Public event: 1
Push event: 1

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Open Source Science

gnns_powergraph

Science Score: 49.0%

Repository

Basic Info

Statistics

Metadata Files

README.md

Master Thesis - Forecasting and Classification of Incidents in Electrical Networks Using Temporal Neural Networks on Graphs

Overview

Summary and Background

Project Description

Repository Structure and Description of main classes

Data Preparation

PowerGridDatasetLoader Class

Model Development

Generalized Training Class

Experiments

Regression

Owner

GitHub Events

Total

Last Year