transforemotion

Sentiment Analysis for Text, Image and Video Using Transformer Models

https://github.com/atomashevic/transforemotion

Science Score: 67.0%

This score indicates how likely this project is to be science-related based on various indicators:

✓
CITATION.cff file
Found CITATION.cff file
✓
codemeta.json file
Found codemeta.json file
✓
.zenodo.json file
Found .zenodo.json file
✓
DOI references
Found 1 DOI reference(s) in README
✓
Academic publication links
Links to: arxiv.org, zenodo.org
○
Committers with academic emails
○
Institutional organization owner
○
JOSS paper metadata
○
Scientific vocabulary similarity
Low similarity (15.7%) to scientific vocabulary

Last synced: 6 months ago · JSON representation ·

Repository

Sentiment Analysis for Text, Image and Video Using Transformer Models

Basic Info

Host: GitHub
Owner: atomashevic
License: gpl-3.0
Language: R
Default Branch: main
Homepage:
Size: 169 MB

Statistics

Stars: 34
Watchers: 4
Forks: 3
Open Issues: 1
Releases: 0

Created about 4 years ago · Last pushed 6 months ago

Metadata Files

Readme Changelog License Citation

CRAN 0.1.6 | GitHub 0.1.6

transforEmotion: Sentiment Analysis for Text, Image and Video Using Transformer Models

With transforEmotion you can use cutting-edge transformer models for zero-shot emotion classification of text, image, and video in R, all without the need for a GPU, subscriptions, paid services, or using Python. All data is processed locally on your machine, and nothing is sent to any external server or third-party service. This ensures full privacy for your data.

How to install the package?
How to run sentiment analysis on text?
How to run facial expression recognition on images?
How to run facial expression recognition on videos?

How to Install

You can find the latest stable version on CRAN. Install it in R with:

R install.packages("transforEmotion")

If you want to use the latest development version, you can install it from GitHub using the devtools package.

```R if(!"devtools" %in% row.names(installed.packages())){ install.packages("devtools") }

devtools::install_github("atomashevic/transforEmotion") ```

After installing the package, load it in R.

```R

Load package

library(transforEmotion) ```

After loading package for the first time, you need to setup the Python virtual environment. This will download the necessary Python packages and models. This step can take a few minutes but it is only required once after installing the package on a new system.

```R

Run Python setup

setup_miniconda() ```

You will be prompted to install GPU libraries. If you have an NVIDIA GPU, select "[Y]es" to install GPU libraries. If you don't have an NVIDIA GPU, select "[N]o" to proceed with CPU-only installation.

You will gain access to all functionalities of the package even without GPU, but be aware that some functions will be significantly slower.

If you have doubts whether you should install GPU libraries, see GPU Support section below.

[!WARNING] If you using radian console in VSCode or in a terminal emulator, you won't be able to set up the transforEmotion package. Radian is written in Python and (in most cases) already runs in your default Python environment. This prevents transforEmotion package from setting up the new virtual environment and installing the correct versions of necessary Python packages. Switch to default R console and everything should work fine.

Text Example

Next load some data with text for analysis. The example below uses item descriptions from the personality trait extraversion in the NEO-PI-R inventory found on the IPIP website.

```R

Load data

data(neoipipextraversion) ```

For the example, the positively worded item descriptions will be used.

```R

Example text

text <- neoipipextraversion$friendliness[1:5] ```

Next, the text can be loaded in the function transformer_scores() to obtain the probability that item descriptions correspond to a certain class. The classes defined below are the facets of extraversion in the NEO-PI-R. The example text data draws from the friendliness facet.

```R

Cross-Encoder DistilRoBERTa

transformer_scores( text = text, classes = c( "friendly", "gregarious", "assertive", "active", "excitement", "cheerful" ) ) ```

The default transformer model is DistilRoBERTa. The model is fast and accurate.

BART

Another model that can be used is BART, a much larger and more computationally intensive model (slower prediction times). The BART model tends to be more accurate but the accuracy gains above DistilRoBERTa are negotiatiable.

```R

Facebook BART Large

transformer_scores( text = text, classes = c( "friendly", "gregarious", "assertive", "active", "excitement", "cheerful" ), transformer = "facebook-bart" ) ```

Any Text Classification Model with a Pipeline on huggingface

Text classification models with a pipeline on huggingface can be used so long as there is a pipeline available for them. Below is an example of Typeform's DistilBERT model.

```R

Directly from huggingface: typeform/distilbert-base-uncased-mnli

transformer_scores( text = text, classes = c( "friendly", "gregarious", "assertive", "active", "excitement", "cheerful" ), transformer = "typeform/distilbert-base-uncased-mnli" ) ```

RAG

The rag function is designed to enhance text generation using Retrieval-Augmented Generation (RAG) techniques. This function allows users to input text data or specify a path to local PDF files, which are then used to retrieve relevant documents.

The rag function supports various large language models (LLMs), including TinyLLAMA, LLAMA-2, Mistral-7B, Orca-2, and Phi-2, each offering different levels of computational efficiency and quality. The default model is TinyLLAMA, which is the fastest model.

Here's an example based on the decription of this package. First, we specify the text data.

R text <- "With `transforEmotion` you can use cutting-edge transformer models for zero-shot emotion classification of text, image, and video in R, *all without the need for a GPU, subscriptions, paid services, or using Python. Implements sentiment analysis using [huggingface](https://huggingface.co/) transformer zero-shot classification model pipelines. The default pipeline for text is [Cross-Encoder's DistilRoBERTa](https://huggingface.co/cross-encoder/nli-distilroberta-base) trained on the [Stanford Natural Language Inference](https://huggingface.co/datasets/snli) (SNLI) and [Multi-Genre Natural Language Inference](https://huggingface.co/datasets/multi_nli) (MultiNLI) datasets. Using similar models, zero-shot classification transformers have demonstrated superior performance relative to other natural language processing models (Yin, Hay, & Roth, [2019](https://arxiv.org/abs/1909.00161)). All other zero-shot classification model pipelines can be implemented using their model name from https://huggingface.co/models?pipeline_tag=zero-shot-classification."

And then we run the rag function.

R rag(text, query = "What is the use case for transforEmotion package?")

This code will provide the output similar to this one.

The use case for transforEmotion package is to use cutting-edge transformer models forzero-shot emotion classification of text, image, and video in R, without the need for a GPU, subscriptions, paid services, or using Python. This package implements sentiment analysis using the Cross-Encoder's DistilRoBERTa model trained on the Stanford Natural Language Inference (SNLI) and MultiNLI datasets. Using similar models, zero-shot classification transformers have demonstrated superior performance relative to other natural language processing models (Yin, Hay, & Roth, [2019](https://arxiv.org/abs/1909.00161)). The transforEmotion package can be used to implement these models and other zero-shot classification model pipelines from the HuggingFace library.>

Image Example

For Facial Expression Recognition (FER) task from images we use Open AI's CLIP transformer model. Two input arguments are needed: the path to image and list of emotion labels.

Path can be either local or an URL. Here's an example of using a URL of Mona Lisa's image from Wikipedia.

```R

Image URL or local filepath

image <- 'https://cdn.mos.cms.futurecdn.net/xRqbwS4odpkSQscn3jHECh-650-80.jpg'

Array of emotion labels

emotions <- c("excitement", "happiness", "pride", "anger", "fear", "sadness", "neutral")

Run FER with base model

image_scores(image, emotions, model = "oai-base") ```

You can define up to 10 emotions. The output is a data frame with 1 row and columns corresponding to emotions. The values are FER scores for each emotion.

If there is no face detected in the image, the output will be a 0x0 data frame.

If there are multiple faces detected in the image, by default the function will return the FER scores for the largest (focal) face. Alternatively, you can select the face on the left or the right side of the image by specifying the face_selection argument.

Video Example

Video processing works by extracting frames from the video and then running the image processing function on each frame. Two input arguments are needed: the path to video and list of emotion labels.

Path can be either a local filepath or a YouTube URL. Support for other video hosting platforms is not yet implemented.

```R

Video URL or local filepath

videourl <- "https://www.youtube.com/watch?v=hdYNcv-chgY&abchannel=Conservatives"

Array of emotion labels

emotions <- c("excitement", "happiness", "pride", "anger", "fear", "sadness", "neutral")

Run FER on `nframes` of the video with large model

result <- videoscores(videourl, classes = emotions, nframes = 10, savevideo = TRUE, saveframes = TRUE, video_name = 'boris-johnson', start = 10, end = 120, model = "oai-large") ```

Working with videos is more computationally complex. This example extracts only 10 frames from the video and shouldn't take longer than a few minutes on an average laptop without GPU (depending on your internet connection needed to download the entire video and CLIP model). In research applications, we will usually extract 100-300 frames from the video. This can take much longer, so patience is advised while waiting for the results.

Available Models

The image_scores and video_scores functions support different models. The available models are:

oai-base: "openai/clip-vit-base-patch32" - A base model that is faster but less accurate. Requires ~2GB of RAM.
oai-large: "openai/clip-vit-large-patch14" - A larger model that is more accurate but slower. Requires ~4GB of RAM.
eva-8B: "BAAI/EVA-CLIP-8B-448" - A very large model that has been quantized to 4-bit precision for reduced memory usage (requires ~8GB of RAM instead of the original ~32GB).
jina-v2: "jinaai/jina-clip-v2" - Another large model with high accuracy but requires more resources (~6GB of RAM).

Note: The memory requirements listed above are approximate and represent the minimum RAM needed. For optimal performance, we recommend having at least 16GB of system RAM when using any of these models. If you're processing videos or multiple images in batch, more RAM might be needed. When using GPU acceleration, similar VRAM requirements apply. We recommend using 'oai-base' or 'oai-large' for most applications as they provide a good balance between accuracy and resource usage.

GPU Support

When running the setup_miniconda() function, you will be prompted to install GPU libraries. If you select "[Y]es" when prompted to install GPU libraries, make sure you have:

An NVIDIA GPU (GTX 1060 or newer)
CUDA Toolkit 11.7+ installed
Updated NVIDIA drivers
GCC/G++ version 9 or newer (Linux only)

Without these requirements, the GPU installation will likely fail. If you're unsure, select "no" to proceed with CPU-only installation.

If GPU installation fails, you can try running the setup_modules() function and selection "[N]o" when prompted to install GPU libraries.

References

BART

Lewis, M., Liu, Y., Goyal, N., Ghazvininejad, M., Mohamed, A., Levy, O., ... & Zettlemoyer, L. (2019). Bart: Denoising sequence-to-sequence pre-training for natural language generation, translation, and comprehension. arXiv preprint arXiv:1910.13461.

RoBERTa

Liu, Y., Ott, M., Goyal, N., Du, J., Joshi, M., Chen, D., ... & Stoyanov, V. (2019). Roberta: A robustly optimized bert pretraining approach. arXiv preprint arXiv:1907.11692.

CLIP

Radford, A., Kim, J. W., Hallacy, C., Ramesh, A., Goh, G., Agarwal, S., Sastry, G., Askell, A., Mishkin, P., Clark, J., Krueger, G., & Sutskever, I. (2021). Learning Transferable Visual Models From Natural Language Supervision. arXiv preprint arXiv:2103.00020

Comparison of Methods

Yin, W., Hay, J., & Roth, D. (2019). Benchmarking zero-shot text classification: Datasets, evaluation and entailment approach. arXiv preprint arXiv:1909.00161.

Owner

Name: Aleksandar Tomašević
Login: atomashevic
Kind: user
Location: Novi Sad, Serbia
Company: University of Novi Sad

Website: www.atomasevic.com
Twitter: atomasevic
Repositories: 2
Profile: https://github.com/atomashevic

Citation (citation.cff)

cff-version: 1.2.0
message: "To cite transforEmotion in publications use:"
authors:
- family-names: "Christensen"
  given-names: "ALexander P"
  orcid: "https://orcid.org/0000-0002-5601-7982"
- family-names: "Golino"
  given-names: "Hudson"
  orcid: "https://orcid.org/0000-0002-1601-1447"
- family-names: "Tomašević"
  given-names: "Aleksandar  "
  orcid: "https://orcid.org/0000-0003-4863-6051"
title: "transforEmotion: Sentiment Analysis for Text, Image and Video using Transformer models"
version: 0.1.4
doi: 10.5281/zenodo.10471355
date-released: 2024
url: "https://github.com/atomashevic/transforEmotion"

GitHub Events

Total

Create event: 4
Release event: 1
Issues event: 7
Watch event: 9
Delete event: 1
Issue comment event: 7
Push event: 29
Pull request event: 1
Fork event: 2

Last Year

Create event: 4
Release event: 1
Issues event: 7
Watch event: 9
Delete event: 1
Issue comment event: 7
Push event: 29
Pull request event: 1
Fork event: 2

Committers

Last synced: 12 months ago

All Time

Total Commits: 163
Total Committers: 3
Avg Commits per committer: 54.333
Development Distribution Score (DDS): 0.46

Past Year

Commits: 47
Committers: 2
Avg Commits per committer: 23.5
Development Distribution Score (DDS): 0.043

Top Committers

Name	Email	Commits
Aleksandar Tomašević	a**c@g**m	88
Alexander Christensen	a**n@g**m	72
hfgolino	h**o@g**m	3

Issues and Pull Requests

Last synced: 6 months ago

All Time

Total issues: 18
Total pull requests: 7
Average time to close issues: 3 months
Average time to close pull requests: 11 minutes
Total issue authors: 8
Total pull request authors: 2
Average comments per issue: 2.22
Average comments per pull request: 0.14
Merged pull requests: 7
Bot issues: 0
Bot pull requests: 0

Past Year

Issues: 3
Pull requests: 1
Average time to close issues: 3 months
Average time to close pull requests: 39 minutes
Issue authors: 2
Pull request authors: 1
Average comments per issue: 0.0
Average comments per pull request: 0.0
Merged pull requests: 1
Bot issues: 0
Bot pull requests: 0

View more stats

Top Authors

Issue Authors

atomashevic (7)
AlexChristensen (3)
myrainbowandsky (2)
rodtaq2015 (1)
csmsm (1)
ibernetti63 (1)
Sdbock (1)
SoaresAlisson (1)

Pull Request Authors

atomashevic (8)
hfgolino (1)

Top Labels

Issue Labels

enhancement (7) bug (5) wontfix (2) good first issue (1)

Pull Request Labels

Dependencies

DESCRIPTION cran

R >= 3.5.0 depends
LSAfun * imports
dplyr * imports
googledrive * imports
pbapply * imports
remotes * imports
reticulate * imports
knitr * suggests
markdown * suggests
rmarkdown * suggests
rstudioapi * suggests

.github/workflows/r.yml actions

actions/checkout v2 composite
r-lib/actions/check-r-package v2 composite
r-lib/actions/setup-pandoc v2 composite
r-lib/actions/setup-r v2 composite
r-lib/actions/setup-r-dependencies v2 composite
r-lib/actions/setup-tinytex v2 composite

transforemotion

Science Score: 67.0%

Repository

Basic Info

Statistics

Metadata Files

README.md

CRAN 0.1.6 | GitHub 0.1.6

transforEmotion: Sentiment Analysis for Text, Image and Video Using Transformer Models

How to Install

Load package

Run Python setup

Text Example

Load data

Example text

Cross-Encoder DistilRoBERTa

BART

Facebook BART Large

Any Text Classification Model with a Pipeline on huggingface

Directly from huggingface: typeform/distilbert-base-uncased-mnli

RAG

Image Example

Image URL or local filepath

Array of emotion labels

Run FER with base model

Video Example

Video URL or local filepath

Array of emotion labels

Run FER on nframes of the video with large model

Available Models

GPU Support

References

BART

RoBERTa

CLIP

Comparison of Methods

Owner

Citation (citation.cff)

GitHub Events

Total

Last Year

Committers

All Time

Past Year

Top Committers

Issues and Pull Requests

All Time

Past Year

Top Authors

Issue Authors

Pull Request Authors

Top Labels

Issue Labels

Pull Request Labels

Dependencies

Run FER on `nframes` of the video with large model