sign-language-recognition-model
Science Score: 54.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
○DOI references
-
✓Academic publication links
Links to: arxiv.org, sciencedirect.com -
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (13.0%) to scientific vocabulary
Repository
Basic Info
- Host: GitHub
- Owner: Dudu197
- Language: Python
- Default Branch: main
- Size: 19.5 KB
Statistics
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
- Releases: 0
Metadata Files
README.md
Sign Language Recognition Model
Enhancing Brazilian Sign Language Recognition through Skeleton Image Representation
Overview
This repository contains a simplified version of the code for training a Sign Language Recognition (SLR) model based on skeleton images, as described in the paper:
Alves, Carlos Eduardo GR, Francisco de Assis Boldt, and Thiago M. Paixão. "Enhancing Brazilian Sign Language Recognition through Skeleton Image Representation." arXiv preprint arXiv:2404.19148 (2024).
This version is designed for quick-start experiments and user-friendliness. For the full experimental codebase, see the original repository.
Table of Contents
Requirements & Installation
- Python 3.7+
- All required libraries are listed in
requirements.txt.
Install dependencies:
bash
pip install -r requirements.txt
Dataset
This code was tested using the MINDS-Libras dataset, but can be adapted for other sign language datasets as well:
MINDS-Libras
- 20 signs, 12 signers, 5 repetitions per sign.
- Download preprocessed MINDS-Libras data
- Place the dataset in the
datasetsfolder.
Libras-UFOP
- 56 signs, 5 signers, 8–16 repetitions per sign.
- Paper
- Preprocessed data may be available in the original repository or by request.
Include-50
- 50 signs, larger and more diverse dataset.
- Download preprocessed Include-50 data
- Place the dataset in the
datasetsfolder.
KSL (Korean Sign Language)
- Korean Sign Language dataset.
- Download preprocessed KSL data
- Place the dataset in the
datasetsfolder.
Note: This repository is primarily set up for MINDS-Libras, but you can adapt the code for other datasets by adjusting file paths and preprocessing as needed.
Directory Structure
datasets/– Place your dataset files here.image_representations/– Skeleton image representation code.models/– Model definitions (e.g., ResNet18).results/– Results and logs will be saved here.model_training.py– Main training script.train_minds.py– Example training script for MINDS-Libras.train.sh– Shell script to run training.
Training
To train the model using the MINDS-Libras dataset, run:
bash
sh train.sh
This will train the model and save results in the results folder.
For more details about how to train, check https://github.com/Dudu197/sign-language-recognition/blob/main/03modeltraining/README.md
Results
Our model achieves strong performance on multiple sign language datasets:
MINDS-Libras:
- Accuracy: ~0.93
- +2 percentage points accuracy, +3 F1-Score over previous SOTA
Libras-UFOP:
- Accuracy: ~0.82
- +8 percentage points accuracy, +9 F1-Score over previous SOTA
Include-50:
- Accuracy: ~0.97 (ResNet18 + Skeleton-DML)
KSL (Korean Sign Language):
- Accuracy: ~0.63 (ResNet18 + Skeleton-DML)
For more results and details, see the original paper.
Citation
If you use this code for your research, please cite our paper:
Alves, Carlos Eduardo GR, Francisco de Assis Boldt, and Thiago M. Paixão. "Enhancing Brazilian Sign Language Recognition through Skeleton Image Representation." arXiv preprint arXiv:2404.19148 (2024).
BibTeX:
bibtex
@article{alves2024enhancing,
title={Enhancing Brazilian Sign Language Recognition through Skeleton Image Representation},
author={Alves, Carlos Eduardo GR and Boldt, Francisco de Assis and Paix{a}o, Thiago M},
journal={arXiv preprint arXiv:2404.19148},
year={2024}
}
License
This project is licensed under the terms of the MIT License.
Owner
- Name: Carlos Eduardo
- Login: Dudu197
- Kind: user
- Location: Volta Redonda, RJ, Brasil
- Company: Dudollar
- Website: https://dudollar.com.br
- Repositories: 6
- Profile: https://github.com/Dudu197
Citation (CITATION.cff)
cff-version: 1.2.0
message: "If you use this software, please cite it as below."
authors:
- family-names: Alves
given-names: Carlos Eduardo Gomes Reddo
- family-names: M Paixão
given-names: Thiago
title: "Enhancing Brazilian Sign Language Recognition Through Skeleton Image Representation"
version: 1.0.0
identifiers:
- type: doi
value: 10.1109/SIBGRAPI62404.2024.10716301
date-released: 2024-10-18
GitHub Events
Total
- Push event: 4
Last Year
- Push event: 4
Dependencies
- matplotlib *
- numpy *
- opencv-python *
- pandas *
- pillow *
- scikit-learn ==1.3.0
- torch ==2.0.1
- torchvision ==0.15.2