nodding-pigeon

Detection and classification of head gestures in videos

https://github.com/bhky/nodding-pigeon

Science Score: 44.0%

This score indicates how likely this project is to be science-related based on various indicators:

✓
CITATION.cff file
Found CITATION.cff file
✓
codemeta.json file
Found codemeta.json file
✓
.zenodo.json file
Found .zenodo.json file
○
DOI references
○
Academic publication links
○
Committers with academic emails
○
Institutional organization owner
○
JOSS paper metadata
○
Scientific vocabulary similarity
Low similarity (14.1%) to scientific vocabulary

Keywords

gesture-classification gesture-detection keras machine-learning tensorflow2

Last synced: 10 months ago · JSON representation ·

Repository

Detection and classification of head gestures in videos

Basic Info

Host: GitHub
Owner: bhky
License: mit
Language: Python
Default Branch: main
Homepage:
Size: 2.55 MB

Statistics

Stars: 17
Watchers: 1
Forks: 3
Open Issues: 0
Releases: 10

Topics

gesture-classification gesture-detection keras machine-learning tensorflow2

Created about 4 years ago · Last pushed over 1 year ago

Metadata Files

Readme License Citation

Introduction

The Nodding Pigeon library provides a pre-trained model and a simple inference API for detecting head gestures in short videos. Under the hood, it uses Google MediaPipe for collecting the landmark features.

Installation

Tested for Python 3.8, 3.9, and 3.10.

The best way to install this library with its dependencies is from PyPI: shell python3 -m pip install --upgrade noddingpigeon Alternatively, to obtain the latest version from this repository: shell git clone git@github.com:bhky/nodding-pigeon.git cd nodding-pigeon python3 -m pip install .

Usage

An easy way to try the API and the pre-trained model is to make a short video with your head gesture.

Webcam

The code snippet below will perform the following: - Search for the pre-trained weights file from $HOME/.noddingpigeon/weights/, if not exists, the file will be downloaded from this repository. - Start webcam. - Collect the needed number of frames (default 60) for the model. - End webcam automatically (or you can press q to end earlier). - Make prediction of your head gesture and print the result to STDOUT. ```python from noddingpigeon.inference import predict_video

result = predict_video() print(result)

Example result:

{'gesture': 'nodding',

'probabilities': {'has_motion': 1.0,

'gestures': {'nodding': 0.9576354622840881,

'turning': 0.042364541441202164}}}

```

Video file

Alternatively, you could provide a pre-recorded video file:

```python from noddingpigeon.inference import predict_video from noddingpigeon.video import VideoSegment # Optional.

result = predictvideo( "yourheadgesturevideo.mp4", videosegment=VideoSegment.LAST, # Optionally change these parameters. motionthreshold=0.5, gesturethreshold=0.9 ) ``Note that no matter how long your video is, only the pre-defined number of frames (60for the current model) are used for prediction. Thevideosegmentenum option controls how the frames are obtained from the video, e.g.,VideoSegment.LASTmeans the last (60`) frames will be used.

Thresholds can be adjusted as needed, see explanation in the head gestures section.

Result format

The result is returned as a Python dictionary.

text { 'gesture': 'turning', 'probabilities': { 'has_motion': 1.0, 'gestures': { 'nodding': 0.009188028052449226, 'turning': 0.9908120036125183 } } }

Head gestures

The following gesture types are available: - nodding - Repeatedly tilt your head upward and downward. - turning - Repeatedly turn your head leftward and rightward. - stationary - Not tilting or turning your head; translation motion is still treated as stationary. - undefined - Unrecognised gesture or no landmarks detected (usually means no face is shown).

To determine the final gesture: - If has_motion probability is smaller than motion_threshold (default 0.5), gesture is stationary. Other probabilities are irrelevant. - Otherwise, the largest probability from gestures is considered: - If it is smaller than gesture_threshold (default 0.9), gesture is undefined, - else, the corresponding gesture label is selected (e.g., nodding). - If no landmarks are detected in the video, gesture is undefined. The probabilities dictionary is empty.

API

`noddingpigeon.inference`

`predict_video`

Detect head gesture shown in the input video either from webcam or file. - Parameters: - video_path (Optional[str], default None): File path to the video file, or None for starting a webcam. - model (Optional[tf.keras.Model], default None): A TensorFlow-Keras model instance, or None for using the default model. - max_num_frames (int, default 60): Maximum number of frames to be processed by the model. Do not change when using the default model.
- video_segment (VideoSegment enum, default VideoSegment.BEGINNING): See explanation of VideoSegment. - end_padding (bool, default True): If True and max_num_frames is set, when the input video has not enough frames to form the feature tensor for the model, padding at the end will be done using the features detected on the last frame. - drop_consecutive_duplicates (bool, default True): If True, features from a certain frame will not be used to form the feature tensor if they are considered to be the same as the previous frame. This is a mechanism to prevent "fake" video created with static images. - postprocessing (bool, default True): If True, the final result will be presented as the Python dictionary described in the usage section, otherwise the raw model output is returned. - motion_threshold (float, default 0.5): See the head gestures section. - gesture_threshold (float, default 0.9): See the head gestures section. - Return: - A Python dictionary if postprocessing is True, otherwise List[float] from the model output.

`noddingpigeon.video`

`VideoSegment`

Enum class for video segment options. - VideoSegment.BEGINNING: Collect the required frames for the model from the beginning of the video. - VideoSegment.LAST: Collect the required frames for the model toward the end of the video.

`noddingpigeon.model`

`make_model`

Create an instance of the model used in this library, optionally with pre-trained weights loaded. - Parameters: - weights_path (Optional[str], default $HOME/.noddingpigeon/weights/*.h5): Path to the weights in HDF5 format to be loaded by the model. The weights file will be downloaded if not exists. If None, no weights will be downloaded nor loaded to the model. Users can provide path if the default is not preferred. The environment variable NODDING_PIGEON_HOME can also be used to indicate where the .noddingpigeon/ directory should be located. - Return: - tf.keras.Model object.

Model training

Brief procedure: - Record a few long-ish videos: one for each head gesture done repeatedly with as many variations as possible, and one for stationary. - Landmark features in the videos are collected using MediaPipe. - During model training, random sub-sequences from the feature collection, correspond to different video segments and gestures, are generated as training samples. - This basically means that all samples generated, in each epoch, are very likely not the same as each other. This serves as a good regularization as well. - A very simple 1D-convolutional model architecture is used to minimise overfitting.

For details, see the data collection and model training scripts in the training directory.

Owner

Name: Bosco Yung
Login: bhky
Kind: user

Repositories: 3
Profile: https://github.com/bhky

Machine Learning Engineer

Citation (CITATION.cff)

cff-version: 1.2.0
message: "If you use this software, please cite it using the following metadata."
title: "Nodding Pigeon"
version: "0.6.0"
url: "https://github.com/bhky/nodding-pigeon"
license: "MIT"
authors:
  - family-names: "Yung"
    given-names: "Bosco"
    orcid: "https://orcid.org/0000-0002-3776-1589"
date-released: "2022-05-13"

GitHub Events

Total

Watch event: 2
Push event: 1

Last Year

Watch event: 2
Push event: 1

Committers

Last synced: almost 2 years ago

All Time

Total Commits: 136
Total Committers: 1
Avg Commits per committer: 136.0
Development Distribution Score (DDS): 0.0

Past Year

Commits: 5
Committers: 1
Avg Commits per committer: 5.0
Development Distribution Score (DDS): 0.0

Top Committers

Name	Email	Commits
Bosco Yung	1****y	136

Issues and Pull Requests

Last synced: 12 months ago

All Time

Total issues: 1
Total pull requests: 4
Average time to close issues: 1 day
Average time to close pull requests: 3 days
Total issue authors: 1
Total pull request authors: 2
Average comments per issue: 2.0
Average comments per pull request: 0.5
Merged pull requests: 1
Bot issues: 0
Bot pull requests: 0

Past Year

Issues: 0
Pull requests: 0
Average time to close issues: N/A
Average time to close pull requests: N/A
Issue authors: 0
Pull request authors: 0
Average comments per issue: 0
Average comments per pull request: 0
Merged pull requests: 0
Bot issues: 0
Bot pull requests: 0

View more stats

Top Authors

Issue Authors

sasakits (1)

Pull Request Authors

bhky (3)
katsunori-waragai (2)

Top Labels

Issue Labels

Pull Request Labels

Packages

Total packages: 1
Total downloads:
- pypi 15 last-month

Total dependent packages: 0
Total dependent repositories: 1
Total versions: 2
Total maintainers: 1

pypi.org: noddingpigeon

Detection and classification of head gestures in videos

Homepage: https://github.com/bhky/nodding-pigeon
Documentation: https://noddingpigeon.readthedocs.io/
License: MIT License
Latest release: 0.5.0
published about 4 years ago

Versions: 2
Dependent Packages: 0
Dependent Repositories: 1
Downloads: 15 Last month

Rankings

Dependent packages count: 10.1%

Stargazers count: 16.1%

Dependent repos count: 21.6%

Forks count: 22.7%

Average: 24.3%

Downloads: 51.0%

Maintainers (1)

bhky

Last synced: 11 months ago

Dependencies

requirements.txt pypi

gdown >=4.2.0
mediapipe >=0.8.9.1
numpy >=1.22.0
opencv-python >=4.0.0.0
tensorflow >=2.7.0

.github/workflows/ci.yml actions

actions/checkout main composite
actions/setup-python main composite

pyproject.toml pypi

nodding-pigeon

Science Score: 44.0%

Keywords

Repository

Basic Info

Statistics

Topics

Metadata Files

README.md

Introduction

Installation

Usage

Webcam

Example result:

{'gesture': 'nodding',

'probabilities': {'has_motion': 1.0,

'gestures': {'nodding': 0.9576354622840881,

'turning': 0.042364541441202164}}}

Video file

Result format

Head gestures

API

noddingpigeon.inference

predict_video

noddingpigeon.video

VideoSegment

noddingpigeon.model

make_model

Model training

Owner

Citation (CITATION.cff)

GitHub Events

Total

Last Year

Committers

All Time

Past Year

Top Committers

Issues and Pull Requests

All Time

Past Year

Top Authors

Issue Authors

Pull Request Authors

Top Labels

Issue Labels

Pull Request Labels

Packages

pypi.org: noddingpigeon

Rankings

Maintainers (1)

Dependencies

`noddingpigeon.inference`

`predict_video`

`noddingpigeon.video`

`VideoSegment`

`noddingpigeon.model`

`make_model`