gaze-fixation-and-object-saliency

This repository is related to estimating the driver's attention to the outside scene view as a point of gaze (PoG) w.r.t the gaze angles extracted from the driver facing view. We also explore analysis of gaze saliecny in form of heatmaps and yarbus plots.

https://github.com/vtti/gaze-fixation-and-object-saliency

Keywords

gaze-estimation gaze-saliency object-detection object-fixation panoptic pog yarbus

Last synced: 11 months ago · JSON representation

Repository

This repository is related to estimating the driver's attention to the outside scene view as a point of gaze (PoG) w.r.t the gaze angles extracted from the driver facing view. We also explore analysis of gaze saliecny in form of heatmaps and yarbus plots.

Basic Info

Host: GitHub
Owner: VTTI
License: mit
Language: Python
Default Branch: master
Homepage:
Size: 46.3 MB

Statistics

Stars: 4
Watchers: 3
Forks: 2
Open Issues: 0
Releases: 0

Topics

gaze-estimation gaze-saliency object-detection object-fixation panoptic pog yarbus

Created over 3 years ago · Last pushed over 3 years ago

Metadata Files

Readme License Citation

README.md

Driver Gaze Fixation and Object Saliency

This repository provides code to estimate Point of Gaze (PoG) on the scene-facing (front video) view w.r.t the gaze angle obtained from the driver-faciing (face video) view. We also provide other functionalities listed below. Please note that the code is tested on the specified models but you could change them according to your needs.

Closest object of fixation from PoG using Panoptic Segmentation (Detectron2)
Closest object of fixation from PoG using Object Detection (SHRP2 trained models)
Fixation Heatmap (Heatmap generated for the current frame by aggregating the PoGs from the last (fps x 3) frames)
Yarbus Plots ## Prerequisite

Download shrp2modelfile into ./models

Download Gaze360modelfile into ./models

Installation

To install the framework the recommened method is by following these steps:

Install Docker (https://www.docker.com/).

Create a new file named "Dockerfile" and copy inside the contents of "docker/Dockerfile".

Run: docker build . -t gazefixation

Run: nvidia-docker run -it --gpus=all --rm -v "pathtolocalfolder":"pathinside_docker" gazefixation bash

Setting up the config file

Below is an examaple of the config file. ```

gaze angle estimation model path, architecture and batch size (we use L2CSNet's specifications)

snapshotpath: ./models/L2CSNetgaze360.pkl arch: ResNet50 batch_size: 1

object detection model's specifications (we have used our inhouse SHRP2 object detection model)

detgpuid: 1 detmodelcheckpoint: ./models/shrp2originalsplit+extradataoutsideobjects.pth detmodelclasses: ./models/classes/classesshrp2extra.txt detmodelconfig: ./models/customconfigs/shrp2+extra/cascadercnnr101fpndconvc3-c51xcoco.py detmodel_threshold: 0.5

dist is the distance in pixels from driver's face to the camera

dist: 480

fpicth and fyaw are the correction of front camera view

fpitch: -0.04 fyaw: 0.15

frontpicth and frontyaw are the angles calculated by estimatefrontangle.py file and are automatically updated in the config file when executed

frontpitch: -0.4002903467336685 frontyaw: 0.03331976381909563

These are just stored to retain previous calculations of frontpicth and frontyaw

oldfrontpitch: -0.4002903467336685 oldfrontyaw: 0.03331976381909563

Segmentation model configurations are stored here. (we use detectron2's panoptic model)

segconfidencethreshold: 0.5 segconfigfile: configs/COCO-PanopticSegmentation/panopticfpnR1013x.yaml segopts: - MODEL.WEIGHTS - detectron2://COCO-PanopticSegmentation/panopticfpnR1013x/139514519/modelfinalcafdb1.pkl segoverlay: 1 ```

Demo Example

This command runs the algorithm on the example video files in the Videos folder. It performs both segmentation and detection and stores the result in the current directory. ./run.sh ./Videos/face_ex.mp4 ./Videos/front_ex.mp4 0 ./config.yaml .

Usage

You could directly use the run.sh command to get the desired output. ./run.sh <face_video_file_path> <front_video_file_path> <gpu_id> <config_file_path> <output_directory> Note: for filtering we use default values for 15 fps cutoff: 2 nyq frequency: 7.5 order of butterworth filter: 4

If you'd like to run the files seperately:

get_gaze.py

The L2CSNet's Pretrained Gaze360 model (https://github.com/Ahmednull/L2CS-Net) was used to get the gaze angles. First we obtain the gaze angles and store them in a seperate CSV file. The parameters needed to obtain the gaze angles are the face video path, front video path (just to get the frame width), config file path and output directory path.

python3 get_gaze.py --face <face_video_file_path> --front <front_video_file_path> --gpu <gpu_id> --config <config_file_path> --output <output_directory_path>

estimatefrontangle.py

The next step is estimating the front angle. This can be done by executing the estimatingfrontangle.py file like so. It needs the gaze angles csv file and an output directory path as arguments.

python3 estimate_front_angle.py --file <gaze_angles_csv> --output <output_directory_path>

filter_csv.py

Since gaze angle estimation is done on a frame by frame basis we filter angles using a butterworth filter. filter_csv.py file takes the arguments of gaze angles, output directory, cutoff frequency, nyq frequency and order of the butterworth filter. As an output it saves a csv file with filtered gaze angles.

``` python3 filtercsv.py <gazeanglescsv> <outputdirectorypath> <cutofffrequency>

```

gaze_overlay.py

gazeoverlayl.py file is used to stich the face and front video side by side. gaze angles from the csv file are taken and transfored to PoG on the front video frames. This file also takes arguments of --objdet and --panseg. if you specify the arguments, the code will perform object detection and segementation respectively, else it will just perform the PoG transformation and save the video output file. It takes the arguments of the face video file, the front video file, output folder path, config file path and optional arguments : panseg, obj_det (give these if you want the funcationality in the output)

``` python3 gazeoverlay.py --face <facevideofilepath> --front --config --outputfolder <outputdirectorypath> --objdet --pan_seg

```

heatmap_overlay.py

The output is generared with heatmaps over the PoG estimates.

python3 heatmap_overlay.py --face <face_video_file_path> --front <front_video_file_path> --config <config_file_path> --output <output_directory_path>

Demo results

output with panoptic segmentation

Yarbus-style visualization

Required Files: Saliency

1.) CSV

Adjusted to Windshield Camera CSV files (detections and gaze location):

e.g., dgf_l2cs_CMVP_0000_0000_10_130218_1429_00079_Face_adjusted_bbox.csv

Frame#  X-value for Gaze    Y-value for Gaze    Object Name

2.) Frames for the video

Directory containing zero-preceeding frame numbers e.g., 000000- 999999