sign-prosody-extraction

Code for the article "Automated Extraction of Prosodic Structure from Unannotated Sign Language Video" (LREC-COLING '24)

https://github.com/agarsev/sign-prosody-extraction

Science Score: 31.0%

This score indicates how likely this project is to be science-related based on various indicators:

✓
CITATION.cff file
Found CITATION.cff file
✓
codemeta.json file
Found codemeta.json file
○
.zenodo.json file
○
DOI references
○
Academic publication links
○
Academic email domains
○
Institutional organization owner
○
JOSS paper metadata
○
Scientific vocabulary similarity
Low similarity (14.1%) to scientific vocabulary

Keywords

prosody sign-language sign-language-processing spanish-sign-language

Last synced: 11 months ago · JSON representation ·

Repository

Code for the article "Automated Extraction of Prosodic Structure from Unannotated Sign Language Video" (LREC-COLING '24)

Basic Info

Host: GitHub
Owner: agarsev
License: eupl-1.2
Language: Jupyter Notebook
Default Branch: main
Homepage:
Size: 604 KB

Statistics

Stars: 0
Watchers: 2
Forks: 0
Open Issues: 0
Releases: 0

Topics

prosody sign-language sign-language-processing spanish-sign-language

Created over 2 years ago · Last pushed over 1 year ago

Metadata Files

Readme License Citation

Automated Extraction of Prosodic Structure from Unannotated Sign Language Video

This repository contains the code for the titular article, presented at LREC-COLING 2024.

Apart from the methodology itself, a command line tool is included that makes reproducing our experiments (hopefully) easy, and allows using our code to extract some information from unannotated video.

This code is free and open source software, licensed under the EUPL.

You can now try it in Colab:

Features

Command line interface.
Install with pip: pip install sign-prosody-extraction
Plot the prosodic structure (velocity and direction profile of articulators along the temporal axis) of a sign language video.
Extract thumbnails at "target points": points of low velocity, i.e. high static visual significance.
Extract a short clip that includes only the sign articulation, ignoring preparation and relaxation. This can help dictionary makers prepare high significance animated thumbnails of their videos.
Use either cotracker or mediapipe as outlined in Börstell (2023) for the articulator tracking.
Export the data as CSV to use it in ELAN or similar.

Usage

```bash pip install sign-prosody-extraction

Compute everythig for VIDEO.mp4, and save in output/

sign-prosody-extraction VIDEO.mp4 --output-dir output/ --everything

See the different options

sign-prosody-extraction --help ```

You can modify some further runtime options with environment variables. Set CACHE_DIR and optionally CACHE_LIMIT to cache some long computations. To use the mediapipe algorithm, the pose_landmarker.task model must be downloaded, and placed in the data folder or in a path pointed to by the env var POSE_LANDMARKER. The cotracker model will be loaded from torchhub.

Article Abstract

As in oral phonology, prosody is an important carrier of linguistic information in sign languages. One of the most prominent ways this reveals itself is in the time structure of signs: their rhythm and intensity of articulation. To be able to empirically see these effects, the velocity of the hands can be computed throughout the execution of a sign. In this article, we propose a method for extracting this information from unlabeled videos of sign language, exploiting CoTracker, a recent advancement in computer vision which can track every point in a video without the need of any calibration or fine-tuning. The dominant hand is identified via clustering of the computed point velocities, and its dynamic profile plotted to make apparent the prosodic structure of signing. We apply our method to different datasets and sign languages, and perform a preliminary visual exploration of results. This exploration supports the usefulness of our methodology for linguistic analysis, though issues to be tackled remain, such as bi-manual signs and a formal and numerical evaluation of accuracy. Nonetheless, the absence of any preprocessing requirements may make it useful for other researchers and datasets.

Read more at the LREC-COLING 2024 proceedings

How to cite

If you use our code, please do cite us! Also don't hesitate to get in touch.

bibtex @inproceedings{sevilla-etal-2024-automated-extraction, title = "Automated Extraction of Prosodic Structure from Unannotated Sign Language Video", author = "Sevilla, Antonio F. G. and Lahoz-Bengoechea, Jos{\'e} Mar{\'\i}a and Diaz, Alberto", editor = "Calzolari, Nicoletta and Kan, Min-Yen and Hoste, Veronique and Lenci, Alessandro and Sakti, Sakriani and Xue, Nianwen", booktitle = "Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)", month = may, year = "2024", address = "Torino, Italia", publisher = "ELRA and ICCL", url = "https://aclanthology.org/2024.lrec-main.161", pages = "1808--1816", }

Authors

[![](https://github.com/agarsev.png?size=100)](https://github.com/agarsev)
Antonio F. G. Sevilla

[![](https://github.com/jmlahoz.png?size=100)](https://github.com/jmlahoz)
José María Lahoz Bengoechea

Alberto Díaz Esteban

Related software

Owner

Name: Antonio F. García Sevilla
Login: agarsev
Kind: user

Website: https://garciasevilla.com
Repositories: 2
Profile: https://github.com/agarsev

Hi! My name is Antonio, computational linguist and engineer.

Citation (CITATION.bib)

@inproceedings{sevilla-etal-2024-automated-extraction,
    title = "Automated Extraction of Prosodic Structure from Unannotated Sign Language Video",
    author = "Sevilla, Antonio F. G.  and
      Lahoz-Bengoechea, Jos{\'e} Mar{\'\i}a  and
      Diaz, Alberto",
    editor = "Calzolari, Nicoletta  and
      Kan, Min-Yen  and
      Hoste, Veronique  and
      Lenci, Alessandro  and
      Sakti, Sakriani  and
      Xue, Nianwen",
    booktitle = "Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)",
    month = may,
    year = "2024",
    address = "Torino, Italia",
    publisher = "ELRA and ICCL",
    url = "https://aclanthology.org/2024.lrec-main.161",
    pages = "1808--1816",
    abstract = "As in oral phonology, prosody is an important carrier of linguistic information in sign languages. One of the most prominent ways this reveals itself is in the time structure of signs: their rhythm and intensity of articulation. To be able to empirically see these effects, the velocity of the hands can be computed throughout the execution of a sign. In this article, we propose a method for extracting this information from unlabeled videos of sign language, exploiting CoTracker, a recent advancement in computer vision which can track every point in a video without the need of any calibration or fine-tuning. The dominant hand is identified via clustering of the computed point velocities, and its dynamic profile plotted to make apparent the prosodic structure of signing. We apply our method to different datasets and sign languages, and perform a preliminary visual exploration of results. This exploration supports the usefulness of our methodology for linguistic analysis, though issues to be tackled remain, such as bi-manual signs and a formal and numerical evaluation of accuracy. Nonetheless, the absence of any preprocessing requirements may make it useful for other researchers and datasets.",
}

GitHub Events

Total

Release event: 1
Push event: 9
Create event: 1

Last Year

Release event: 1
Push event: 9
Create event: 1

Packages

Total packages: 1
Total downloads:
- pypi 4 last-month

Total dependent packages: 0
Total dependent repositories: 0
Total versions: 4
Total maintainers: 1

pypi.org: sign-prosody-extraction

Code and command-line tool for "Automated Extraction of Prosodic Structure from Unannotated Sign Language Video" (Sevilla et al., 2024).

Documentation: https://sign-prosody-extraction.readthedocs.io/
License: EUPL-1.2
Latest release: 24.12
published over 1 year ago

Versions: 4
Dependent Packages: 0
Dependent Repositories: 0
Downloads: 4 Last month

Rankings

Dependent packages count: 9.4%

Average: 35.7%

Dependent repos count: 62.0%

Maintainers (1)

agarsev

Last synced: 11 months ago

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Open Source Science