dataset-hornet-tools

Repository to host the tools used to process and analyze the data for the Stratosphere CTU Hornet datasets.

https://github.com/stratosphereips/dataset-hornet-tools

Science Score: 44.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (6.7%) to scientific vocabulary
Last synced: 6 months ago · JSON representation ·

Repository

Repository to host the tools used to process and analyze the data for the Stratosphere CTU Hornet datasets.

Basic Info
  • Host: GitHub
  • Owner: stratosphereips
  • License: gpl-2.0
  • Language: Python
  • Default Branch: main
  • Size: 68.4 KB
Statistics
  • Stars: 0
  • Watchers: 2
  • Forks: 0
  • Open Issues: 0
  • Releases: 1
Created over 1 year ago · Last pushed about 1 year ago
Metadata Files
Readme Contributing License Code of conduct Citation Security

README.md

Dataset Hornet Tools

This repository contains a tool suite to work with the Stratosphere Hornet Datasets of geografically distributed honeypots.

Cleaning, Ingestion, and Analysis Tools

The repository contains scripts and tools to process the Hornet data. These are organised by activity:

  • Cleaning: contains scripts used for data cleaning, such as removing of specific IPs, etc.
  • Ingestion: contains tools and scripts to ingest the data into a usable DB that can be used for further processing.
  • Metrics: contains tools and scripts to perform various data analysis tasks, gathering of statistics, etc.

Dataset Metrics

To generate the key dataset metrics as they appear in the paper, we recommend running the metrics/duckdb_metrics.py tool with the parameter --metrics:

bash python3 metrics/duckdb_metrics.py --db_name ./ctu-hornet-65-niner_v0.1.db --metrics

For additional information see the additional information on the metrics/ folder.

About

This repo was developed at the Stratosphere Laboratory at the Czech Technical University in Prague.

Owner

  • Name: Stratosphere IPS
  • Login: stratosphereips
  • Kind: organization
  • Location: Prague

Cybersecurity Research Laboratory at the Czech Technical University in Prague. Creators of Slips, a free software machine learning-based behavioral IDS/IPS.

Citation (CITATION.cff)

doi: 10.5281/zenodo.1234
cff-version: 1.2.0
title: >-
  Dataset Hornet Tools: a tool suite to work with the Stratosphere Hornet Datasets
message: 'If you use this software, please cite it as specified below.'
type: software
authors:
  - given-names: Veronica
    family-names: Valeros
    email: valerver@fel.cvut.cz
    affiliation: >-
      Stratosphere Laboratory, AIC, FEL, Czech
      Technical University in Prague
    orcid: 'https://orcid.org/0000-0003-2554-3231'
date-released: 2024-07-24
url: "https://github.com/stratosphereips/dataset-hornet-tools"

GitHub Events

Total
  • Create event: 5
  • Release event: 1
  • Issues event: 11
  • Delete event: 1
  • Push event: 12
  • Pull request event: 2
Last Year
  • Create event: 5
  • Release event: 1
  • Issues event: 11
  • Delete event: 1
  • Push event: 12
  • Pull request event: 2

Dependencies

.github/workflows/autotag.yml actions
  • actions/checkout v2 composite
  • anothrNick/github-tag-action 1.36.0 composite
requirements.txt pypi
  • duckdb *
  • ijson *