https://github.com/cad-polito-it/fod

Science Score: 26.0%

This score indicates how likely this project is to be science-related based on various indicators:

○
CITATION.cff file
✓
codemeta.json file
Found codemeta.json file
✓
.zenodo.json file
Found .zenodo.json file
○
DOI references
○
Academic publication links
○
Academic email domains
○
Institutional organization owner
○
JOSS paper metadata
○
Scientific vocabulary similarity
Low similarity (7.6%) to scientific vocabulary

Last synced: 10 months ago · JSON representation

Repository

Basic Info

Host: GitHub
Owner: cad-polito-it
Language: Jupyter Notebook
Default Branch: master
Size: 794 KB

Statistics

Stars: 0
Watchers: 3
Forks: 0
Open Issues: 0
Releases: 0

Created about 1 year ago · Last pushed about 1 year ago

Metadata Files

Readme

Faulty Output Dataset

Faulty Output Dataset (FOD) is a custom dataset, developed to evaluate fault detection metrics. The dataset is created by injecting permanent faults in the synaptic weights of the Fast-SCNN network and processing images taken from the Cityscapes validation partition, made of 500 labelled images. Faults are generated through Statistical Fault Injection (error = 2%, confidence = 99%), selecting 8,439 faults at specific weights in the Fast-SCNN model, flipping the most significant bit in a randomly selected synaptic weight, thus significantly altering the parameter values due to the FP32 representation. From the 8439 × 500 faulty outputs, a total of random 68,001 image-fault pairs were selected.

Dataset Examples

| Frame | Injection | mIoU | PA | Label | Clean Output | Faulty Output | |:-----:|:---------:|:----:|:--:|:-----:|:------------:|:-------------:| | 479 | 595 | 0.017 | 0.19 | 1 | | | | 236 | 191 | 1.0 | 1.0 | 0 | | | | 462 | 4273 | 0.98 | 1.0 | 0 | | | | 295 | 9729 | 0.0048 | 0.082 | 1 | | | | 189 | 3680 | 0.9 | 1.0 | 0 | | | | 102 | 6016 | 1.0 | 1.0 | 0 | | | | 398 | 7110 | 0.038 | 0.49 | 1 | | | | 331 | 7844 | 0.019 | 0.11 | 1 | | | | 10 | 1184 | 0.0062 | 0.035 | 1 | | | | 191 | 12040 | 0.99 | 1.0 | 0 | | |

The FOD dataset features a pronounced class prevalence, with 88.10% of the entries labeled as critical and 11.90% as non-critical. Additionally, a significant portion of the dataset 43.07% consists of masks where all pixels belong to a single class, most commonly 'road'. The remaining 56.93% comprises more complex masks with varying class distributions. The FOD dataset includes information on fault parameters, such as the frame index and the fault details (including the affected layer, tensor index, and bit position), as well as performance metrics such as mIoU and PA. However, these metrics are not used for labeling the faults. Instead, the labels are manually assigned based on whether the output mask is classified as critical (C) or non-critical (NC). To determine whether a mask is considered critical, we defined three heuristic rules. If at least one of these points is satisfied, the mask is labeled as critical. The three rules are as follows:

Uniform Class Labeling: The mask consists of pixels that are all assigned to the same class, ensuring that every pixel in the mask is consistently labeled.
Large Spatially Continuous Misclassified Regions: The mask contains large, spatially continuous areas of misclassified pixels, which disrupt the visual coherence of the segmented object, resulting in a distortion of its perceived shape.
Incorrect Classification of Specific Categories: The mask includes contiguous regions of pixels from categories that involve the presence of a human being, spanning from class 11 (person) to class 18 (bicycle), that are incorrectly classified.

This custom FOD dataset serves as a foundation for evaluating the effectiveness of a new fault detection approach (visit Segmentation Metrics).

Dataset Testing

The dataset is available on Google Drive. By adding the shortcut to your Drive it is possible to test it on Google Colab with the link at the top of this README. You can also test it locally with dataset_analysis.ipynb notebook, which explains how to properly use the dataset.

Main Contributors

Lorenzo Fezza: lorenzo.fezza@polito.it
Vittorio Turco: vittorio.turco@polito.it
Ernesto Sanchez: ernesto.sanchez@polito.it
Annachiara Ruospo: annachiara.ruospo@polito.it
Matteo Sonza Reorda: matteo.sonzareorda@polito.it

Owner

Name: CAD & Reliability Group
Login: cad-polito-it
Kind: organization
Location: Italy

Website: https://cad.polito.it
Repositories: 1
Profile: https://github.com/cad-polito-it

Works produced by the CAD & Reliability group of the Department of Control and Computer Engineering (DAUIN) of Politecnico di Torino

GitHub Events

Total

Member event: 1
Push event: 2
Create event: 2

Last Year

Member event: 1
Push event: 2
Create event: 2

Dependencies

requirements.txt pypi

Pillow *
ipython *
ipywidgets *
matplotlib *
numpy *
pandas *
torch *

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Open Source Science