ecs235a-cis-cracking-the-code-advanced-intrusion-detection-frameworks-for-scada-security

https://github.com/tanvimehta11/ecs235a-cis-cracking-the-code-advanced-intrusion-detection-frameworks-for-scada-security

Science Score: 49.0%

This score indicates how likely this project is to be science-related based on various indicators:

○
CITATION.cff file
✓
codemeta.json file
Found codemeta.json file
✓
.zenodo.json file
Found .zenodo.json file
✓
DOI references
Found 2 DOI reference(s) in README
✓
Academic publication links
Links to: arxiv.org, springer.com
○
Academic email domains
○
Institutional organization owner
○
JOSS paper metadata
○
Scientific vocabulary similarity
Low similarity (10.7%) to scientific vocabulary

Last synced: 9 months ago · JSON representation

Repository

Basic Info

Host: GitHub
Owner: tanvimehta11
Language: Jupyter Notebook
Default Branch: main
Size: 2.67 MB

Statistics

Stars: 0
Watchers: 1
Forks: 0
Open Issues: 0
Releases: 0

Created over 1 year ago · Last pushed over 1 year ago

Metadata Files

Readme Citation

Cracking the Code: Investigating Advanced Intrusion Detection Frameworks for SCADA Security

Overview

This repository contains code and analysis for implementing advanced intrusion detection techniques in SCADA (Supervisory Control and Data Acquisition) systems. SCADA systems are critical for managing industrial operations, but their increasing connectivity to the internet exposes them to cyber threats such as buffer overflows, SQL injection, and cross-site scripting. This project investigates how advanced machine learning and statistical methods can identify and mitigate unauthorized access or anomalies in SCADA systems in real-time.

The work focuses on the Gas Pipeline Dataset, utilizing state-of-the-art techniques to improve the resilience and security of SCADA environments. The dataset comprises 274,628 instances across multiple classes and covers 7 types of cyberattacks, making it an ideal choice for evaluating intrusion detection strategies.

Project Objectives

Implement Advanced Detection Techniques: Develop and compare machine learning models, such as Random Forest and Convolutional Neural Networks, for detecting intrusions in SCADA systems.
Enhance SCADA Security: Analyze key vulnerabilities, particularly in the Modbus protocol, and evaluate strategies to enhance system resilience.
Performance Benchmarking: Utilize metrics such as accuracy, precision, recall, and F1 score to identify optimal intrusion detection approaches.

Dataset

Source: Gas Pipeline Dataset
Features: 17 columns, including command payload features, network characteristics, and response payloads.
Labels: Binary, categorized, and specific results for various types of cyberattacks.
Notable Attributes: Timestamp, source/destination IP, protocol, packet size, and 11 command payload features.

Methodology

Data Preprocessing:
- Cleaning and handling missing values.
- Normalizing data for consistency.
- Splitting the dataset into training and testing sets.
Model Development:
- Baseline Models: Random Forest and Decision Trees KNN, Random Forest, Naive Bayes, MLP for initial analysis.
- Advanced Models: Stacked neural networks using Convolutional Neural Networks (CNNs) with ReLU activation and batch normalization. LSTM and GRU networks for sequence modeling.
Evaluation:
- Metrics: Accuracy, precision, recall, F1 score, ROC-AUC and confusion matrices.
- Comparative analysis of traditional and deep learning models.

Code Files

1. `Baseline_Model_Comparison.ipynb`

Implements baseline models such as Random Forest and Decision Trees.
Provides exploratory data analysis and pre-processing steps.

2. `Stacked_NN.ipynb`

Implements a stacked neural network with advanced configurations like batch normalization.
Evaluates performance on the test dataset and provides insights on classification accuracy.

3. `ICS-IDS.ipynb`

The code cleans, standardizes, and balances the dataset using ENN for robust model training.
Trains and evaluates ML (KNN, Random Forest) and DL (MLP, LSTM, GRU) models with performance visualizations.

Results

Baseline Models: Achieved ~85% accuracy with Random Forest and Decision Trees.
Advanced Models: Stacked Neural Networks improved detection rates, achieving up to 93% accuracy.
ICS-IDS: Combining multiple techniques like feature selection and deep learning models yielded the most robust results. GRU achieved 88.6%, LSTM achieved 87.6%, Random Forest achieved approximately 97.6%, and KNN was 97%.

Technologies and Tools

Languages: Python
Libraries: Scikit-learn, TensorFlow, Pandas, NumPy
Tools: Jupyter Notebook, SciPy, Matplotlib
Dataset Format: ARFF and CSV

Future Work

Extend analysis to include real-time deployment of intrusion detection systems.
Evaluate scalability for larger SCADA systems with live data streams.
Explore additional cybersecurity frameworks like hybrid anomaly detection methods.

References

For further details, please refer to the project proposal and accompanying documentation.

Owner

Name: Tanvi Mehta
Login: tanvimehta11
Kind: user
Location: Pune

Repositories: 1
Profile: https://github.com/tanvimehta11

GitHub Events

Total

Push event: 4

Last Year

Push event: 4

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Open Source Science

ecs235a-cis-cracking-the-code-advanced-intrusion-detection-frameworks-for-scada-security

Science Score: 49.0%

Repository

Basic Info

Statistics

Metadata Files

README.md

Cracking the Code: Investigating Advanced Intrusion Detection Frameworks for SCADA Security

Overview

Project Objectives

Dataset

Methodology

Code Files

1. `Baseline_Model_Comparison.ipynb`

2. `Stacked_NN.ipynb`

3. `ICS-IDS.ipynb`

Results

Technologies and Tools

Future Work

References

Owner

GitHub Events

Total

Last Year

ecs235a-cis-cracking-the-code-advanced-intrusion-detection-frameworks-for-scada-security

Science Score: 49.0%

Repository

Basic Info

Statistics

Metadata Files

README.md

Cracking the Code: Investigating Advanced Intrusion Detection Frameworks for SCADA Security

Overview

Project Objectives

Dataset

Methodology

Code Files

1. Baseline_Model_Comparison.ipynb

2. Stacked_NN.ipynb

3. ICS-IDS.ipynb

Results

Technologies and Tools

Future Work

References

Owner

GitHub Events

Total

Last Year

1. `Baseline_Model_Comparison.ipynb`

2. `Stacked_NN.ipynb`

3. `ICS-IDS.ipynb`