netra

nlp model to classify cybersecurity report descriptions

https://github.com/chiragagg5k/netra

Science Score: 44.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (11.5%) to scientific vocabulary
Last synced: 6 months ago · JSON representation ·

Repository

nlp model to classify cybersecurity report descriptions

Basic Info
  • Host: GitHub
  • Owner: ChiragAgg5k
  • License: mit
  • Language: Jupyter Notebook
  • Default Branch: master
  • Size: 44.1 MB
Statistics
  • Stars: 1
  • Watchers: 1
  • Forks: 1
  • Open Issues: 0
  • Releases: 0
Created over 1 year ago · Last pushed 8 months ago
Metadata Files
Readme License Citation

README.md

Netra - AI-Powered Cybercrime Classification System

Python Version License Framework

Overview

Netra is an advanced cybercrime classification system that uses Natural Language Processing (NLP) to automatically categorize cybercrime complaints. Built for the IndiaAI CyberGuard Hackathon, it employs dual Random Forest classifiers to simultaneously predict both main categories and subcategories of cybercrime incidents.

Key Features

  • Dual-classification system with 89.5% accuracy
  • Advanced text preprocessing pipeline
  • Production-ready with comprehensive error handling
  • Automated model retraining capabilities
  • Privacy-preserving feature extraction

Quick Start

Prerequisites

Installation

```bash

Clone the repository

git clone https://github.com/ChiragAgg5k/netra.git cd netra

Create and activate virtual environment

uv sync ```

What does the repository contain?

  1. The src/ directory contains various ipynb notebooks for the project, including a SVM, Random Forest and Multi-Vote architecture for training and testing the pipeline.

  2. data/ folder contains test.csv and train.csv files for training and testing the pipeline. These files were obtained from the IndiaAI CyberGuard Hackathon.

  3. assets/ folder contains the graphs generated in the notebooks.

Contact

For any queries or support: - Email: chiragaggarwal5k@gmail.com - GitHub Issues: Create an issue

Owner

  • Name: Chirag Aggarwal
  • Login: ChiragAgg5k
  • Kind: user
  • Location: Noida , Uttar Pradesh , India
  • Company: Bennett University

CSE Undergrad | Student at Bennett University

Citation (CITATION.cff)

cff-version: 1.2.0
message: "If you use this software, please cite it as below."
authors:
  - family-names: Aggarwal
    given-names: Chirag
    orcid: https://orcid.org/0009-0000-7455-3941
title: "Netra - AI-Powered Cybercrime Classification System"
version: 0.1.0
date-released: 2025-07-12

GitHub Events

Total
  • Watch event: 1
  • Push event: 19
  • Fork event: 1
  • Create event: 2
Last Year
  • Watch event: 1
  • Push event: 19
  • Fork event: 1
  • Create event: 2

Issues and Pull Requests

Last synced: over 1 year ago

All Time
  • Total issues: 0
  • Total pull requests: 0
  • Average time to close issues: N/A
  • Average time to close pull requests: N/A
  • Total issue authors: 0
  • Total pull request authors: 0
  • Average comments per issue: 0
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 0
  • Pull requests: 0
  • Average time to close issues: N/A
  • Average time to close pull requests: N/A
  • Issue authors: 0
  • Pull request authors: 0
  • Average comments per issue: 0
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
Pull Request Authors
Top Labels
Issue Labels
Pull Request Labels

Dependencies

pyproject.toml pypi
  • nltk ^3.9.1
  • numpy ^2.1.2
  • pandas ^2.2.3
  • python ^3.11
  • scikit-learn ^1.5.2
  • seaborn ^0.13.2
uv.lock pypi
  • appnope 0.1.4
  • asttokens 3.0.0
  • cffi 1.17.1
  • colorama 0.4.6
  • comm 0.2.2
  • contourpy 1.3.2
  • cycler 0.12.1
  • debugpy 1.8.14
  • decorator 5.2.1
  • executing 2.2.0
  • fonttools 4.58.5
  • ipykernel 6.29.5
  • ipython 9.4.0
  • ipython-pygments-lexers 1.1.1
  • jedi 0.19.2
  • joblib 1.5.1
  • jupyter-client 8.6.3
  • jupyter-core 5.8.1
  • kiwisolver 1.4.8
  • matplotlib 3.10.3
  • matplotlib-inline 0.1.7
  • nest-asyncio 1.6.0
  • netra 0.1.0
  • numpy 2.3.1
  • packaging 25.0
  • pandas 2.3.1
  • parso 0.8.4
  • pexpect 4.9.0
  • pillow 11.3.0
  • platformdirs 4.3.8
  • prompt-toolkit 3.0.51
  • psutil 7.0.0
  • ptyprocess 0.7.0
  • pure-eval 0.2.3
  • pycparser 2.22
  • pygments 2.19.2
  • pyparsing 3.2.3
  • python-dateutil 2.9.0.post0
  • pytz 2025.2
  • pywin32 310
  • pyzmq 27.0.0
  • scikit-learn 1.7.0
  • scipy 1.16.0
  • seaborn 0.13.2
  • six 1.17.0
  • stack-data 0.6.3
  • threadpoolctl 3.6.0
  • tornado 6.5.1
  • traitlets 5.14.3
  • typing-extensions 4.14.1
  • tzdata 2025.2
  • wcwidth 0.2.13
  • wordcloud 1.9.4