sign-language-recognition-model

https://github.com/dudu197/sign-language-recognition-model

Science Score: 54.0%

This score indicates how likely this project is to be science-related based on various indicators:

✓
CITATION.cff file
Found CITATION.cff file
✓
codemeta.json file
Found codemeta.json file
✓
.zenodo.json file
Found .zenodo.json file
○
DOI references
✓
Academic publication links
Links to: arxiv.org, sciencedirect.com
○
Academic email domains
○
Institutional organization owner
○
JOSS paper metadata
○
Scientific vocabulary similarity
Low similarity (13.0%) to scientific vocabulary

Last synced: 10 months ago · JSON representation ·

Repository

Basic Info

Host: GitHub
Owner: Dudu197
Language: Python
Default Branch: main
Size: 19.5 KB

Statistics

Stars: 0
Watchers: 1
Forks: 0
Open Issues: 0
Releases: 0

Created over 1 year ago · Last pushed 12 months ago

Metadata Files

Readme Citation

Sign Language Recognition Model

Enhancing Brazilian Sign Language Recognition through Skeleton Image Representation

Overview

This repository contains a simplified version of the code for training a Sign Language Recognition (SLR) model based on skeleton images, as described in the paper:

Alves, Carlos Eduardo GR, Francisco de Assis Boldt, and Thiago M. Paixão. "Enhancing Brazilian Sign Language Recognition through Skeleton Image Representation." arXiv preprint arXiv:2404.19148 (2024).

This version is designed for quick-start experiments and user-friendliness. For the full experimental codebase, see the original repository.

Overview
Requirements & Installation
Dataset
Directory Structure
Training
Results
Citation
License

Requirements & Installation

Python 3.7+
All required libraries are listed in requirements.txt.

Install dependencies: bash pip install -r requirements.txt

Dataset

This code was tested using the MINDS-Libras dataset, but can be adapted for other sign language datasets as well:

MINDS-Libras

20 signs, 12 signers, 5 repetitions per sign.
Download preprocessed MINDS-Libras data
Place the dataset in the datasets folder.

Libras-UFOP

56 signs, 5 signers, 8–16 repetitions per sign.
Paper
Preprocessed data may be available in the original repository or by request.

Include-50

50 signs, larger and more diverse dataset.
Download preprocessed Include-50 data
Place the dataset in the datasets folder.

KSL (Korean Sign Language)

Korean Sign Language dataset.
Download preprocessed KSL data
Place the dataset in the datasets folder.

Note: This repository is primarily set up for MINDS-Libras, but you can adapt the code for other datasets by adjusting file paths and preprocessing as needed.

Directory Structure

datasets/ – Place your dataset files here.
image_representations/ – Skeleton image representation code.
models/ – Model definitions (e.g., ResNet18).
results/ – Results and logs will be saved here.
model_training.py – Main training script.
train_minds.py – Example training script for MINDS-Libras.
train.sh – Shell script to run training.

Training

To train the model using the MINDS-Libras dataset, run:

bash sh train.sh

This will train the model and save results in the results folder.

For more details about how to train, check https://github.com/Dudu197/sign-language-recognition/blob/main/03modeltraining/README.md

Results

Our model achieves strong performance on multiple sign language datasets:

MINDS-Libras:
- Accuracy: ~0.93
- +2 percentage points accuracy, +3 F1-Score over previous SOTA
Libras-UFOP:
- Accuracy: ~0.82
- +8 percentage points accuracy, +9 F1-Score over previous SOTA
Include-50:
- Accuracy: ~0.97 (ResNet18 + Skeleton-DML)
KSL (Korean Sign Language):
- Accuracy: ~0.63 (ResNet18 + Skeleton-DML)

For more results and details, see the original paper.

Citation

If you use this code for your research, please cite our paper:

Alves, Carlos Eduardo GR, Francisco de Assis Boldt, and Thiago M. Paixão. "Enhancing Brazilian Sign Language Recognition through Skeleton Image Representation." arXiv preprint arXiv:2404.19148 (2024).

BibTeX: bibtex @article{alves2024enhancing, title={Enhancing Brazilian Sign Language Recognition through Skeleton Image Representation}, author={Alves, Carlos Eduardo GR and Boldt, Francisco de Assis and Paix{a}o, Thiago M}, journal={arXiv preprint arXiv:2404.19148}, year={2024} }

License

This project is licensed under the terms of the MIT License.

Owner

Name: Carlos Eduardo
Login: Dudu197
Kind: user
Location: Volta Redonda, RJ, Brasil
Company: Dudollar

Website: https://dudollar.com.br
Repositories: 6
Profile: https://github.com/Dudu197

Citation (CITATION.cff)

cff-version: 1.2.0
message: "If you use this software, please cite it as below."
authors:
  - family-names: Alves
    given-names: Carlos Eduardo Gomes Reddo
  - family-names: M Paixão
    given-names: Thiago
title: "Enhancing Brazilian Sign Language Recognition Through Skeleton Image Representation"
version: 1.0.0
identifiers:
  - type: doi
    value: 10.1109/SIBGRAPI62404.2024.10716301
date-released: 2024-10-18

GitHub Events

Total

Push event: 4

Last Year

Push event: 4

Dependencies

requirements.txt pypi

matplotlib *
numpy *
opencv-python *
pandas *
pillow *
scikit-learn ==1.3.0
torch ==2.0.1
torchvision ==0.15.2

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Open Source Science