Logodetect

Logodetect: One-shot detection of logos in image and video data - Published in JOSS (2022)

https://github.com/heldenkombinat/logodetect

Science Score: 93.0%

This score indicates how likely this project is to be science-related based on various indicators:

○
CITATION.cff file
✓
codemeta.json file
Found codemeta.json file
✓
.zenodo.json file
Found .zenodo.json file
✓
DOI references
Found 4 DOI reference(s) in README and JOSS metadata
✓
Academic publication links
Links to: arxiv.org, joss.theoj.org
○
Committers with academic emails
○
Institutional organization owner
✓
JOSS paper metadata
Published in Journal of Open Source Software

Keywords

image-processing logo-detection object-detection one-shot-learning small-dataset video-processing

Keywords from Contributors

blackhole gravitational-lenses meshes pypi annotations simulations hydrology stellar exoplanets pde

Scientific Fields

Mathematics Computer Science - 84% confidence

Last synced: 6 months ago · JSON representation

Repository

Find logos in images and videos in just one-shot. Never be embarrassed again to say that you have a small data situation!

Basic Info

Host: GitHub
Owner: Heldenkombinat
License: agpl-3.0
Language: Jupyter Notebook
Default Branch: master
Homepage: https://logodetect.netlify.com/
Size: 19.1 MB

Statistics

Stars: 67
Watchers: 5
Forks: 10
Open Issues: 1
Releases: 2

Topics

image-processing logo-detection object-detection one-shot-learning small-dataset video-processing

Created over 6 years ago · Last pushed about 2 years ago

Metadata Files

Readme License Support Codemeta

README.md

Never be embarrassed again to say you have no big data! logodetect is a one-shot detection library to find logos of any kind in video and image data.

Here's a quick example of football footage that can detect all logos on jerseys and the sports field. Go check out our demo if you want to see it in action right away.

Introduction

There is plenty of literature on the use of deep-learning for detecting logos, so, additionally to sharing with you a couple of algorithms to get started with one-shot logo-detection, the aim of this project is to develop a flexible architecture to facilitate the comparison of different algorithms for one-shot object recognition.

The pipeline supports one or two stages. It is possible to only perform object-recognition, or to first perform object-detection and then object-recognition in a second stage.

The idea is that you can use a generic detector for a single class of objects (e.g. logos, traffic signs or faces) and then compare each of its detections with the exemplar, i.e., the sub-class that you are trying to recognize, to determine if both belong to the same sub-class (e.g. a concrete brand, a stop sign or the face of a loved one). To get started, we include two algorithms that you can play with. Both have a Faster-RCNN [1] in the first stage that performs object-detection and they differ in the second stage that performs object-recognition.

As a baseline, we bring the exemplars and the detections from the first stage to the same latent space (this reduces the course of dimensionality) and then simply measure the Euclidian or the cosine distance between both embeddings for object-recognition. Both inputs are considered to belong to the same sub-class if their distance is below a threshold determined in a preliminary analysis of the training dataset. The code also provides functionality to add various transformations, so you have the option to augment each exemplar with different transformations if you want. Simply add one or more exemplars into the data/exemplars folder that is generated after you've followed the installation instructions below, and you are good to go.

As a first reference against the baseline, we also provide a modified ResNet [2] for object-recognition that directly takes the exemplars and the detections from the first stage and predicts if both belong to the same sub-class. Similarly to [3], this network infers a distance metric after being trained with examples of different sub-classes, but instead of sharing the same weights and processing each input in a separate pass as in [4], it concatenates both inputs and processes them in one pass. This concept follows more closely the architecture proposed by [5], where the assumption is that the exemplars often have more high-frequency components than the detections, and therefore the model can increase its accuracy by learning a separate set of weights for each input. However, our proposed architecture splits the detection and classification tasks in two separate stages, which allows the use, and comparison, of different classifiers for the second stage.

The models that we are including in the repo achieved a reasonable performance after a few training epochs. However, if you would like to improve their performance you can find pointers to various datasets in [6], which can be used in the training part of this project.

References

[1] Ren et. al. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks (2016)\ [2] He et. al. Deep Residual Learning for Image Recognition (2016)\ [3] Hsieh et. al. One-Shot Object Detection with Co-Attention and Co-Excitation (2019)\ [4] Koch et. al. Siamese Neural Networks for One-shot Image Recognition (2015)\ [5] Bhunia et. al. A Deep One-Shot Network for Query-based Logo Retrieval (2019)\ [6] Hoi et. al. LOGO-Net: Large-scale Deep Logo Detection and Brand Recognition with Deep Region-based Convolutional Networks (2015)

Installation

This library is intended for Linux-based OS, such as Ubuntu, and currently can't be run in Windows OS. It depends on libgl1, which you can install with apt-get install libgl1. Logodetect is available on PyPI, so you can simply run pip install logodetect to install it. Make sure you have Python version 3.7 or later installed and to have an up-to-date pip version by running pip install -U pip. Also, we recommend working with virtual environments, but that is ultimately up to you.

If you want to build logodetect from source, run

bash_script git clone git@github.com:Heldenkombinat/logodetect.git cd logodetect pip install -e ".[tests, dev]"

Depending on your system and setup, you might have to run the install command as sudo.

Usage

After successful installation, a CLI tool called logodetect becomes available to you. If you invoke logodetect without any arguments, you will get help on how to use it. To automatically download all models and data needed to test the application first run the following command in your clone of this repository:

bash_script export LOGOS_RECOGNITION=$(pwd) logodetect init

which will download all files to the current working directory. Note that if you prefer another folder to download the data, please use the environment variable LOGOS_RECOGNITION accordingly. Consider putting this variable in your .bash_rc, .zshrc or an equivalent configuration file on your system. If you don't specify a folder, it will default to ~/.hkt/logodetect.

After running the logodetect init CLI, you'll find data and models relative to the specified folder in the following structure:

text data/ exemplars/ exemplars_100x100/ exemplars_100x100_aug/ exemplars_hq/ test_images/ test_videos/ models/ classifier_resnet18.pth detector.pth embedder.pth

If you're interested in training your own algorithms, it's a good idea to have a look at how the exemplar data is structured. For more on training, see the training folder and its readme.

The logodetect CLI tool comes with two main commands, namely video and image, both of which work fairly similarly. In each case you need to provide the input data for which you would like to detect logos, and the logo exemplars that you want to detect in the footage. To get you started, we've provided some demo data that you can use out of the box. That means you can simply run:

bash_script logodetect video

which should output the following text:

```text Rendering video: 100%|████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 17/17 [00:00<00:00, 707.42it/s] Moviepy - Building video /path/data/testvideos/testvideosmall50msoutput.mp4. Moviepy - Writing video /path/data/testvideos/testvideosmall50msoutput.mp4

Moviepy - Done ! Moviepy - video ready /path/data/testvideos/testvideosmall50ms_output.mp4 All done! ✨ 🍰 ✨ ```

to run one-shot detection on an example video, or you can run

bash_script logodetect image

to do so for an example image, which results in the following output:

text Saved resulting image as /path/data/test_images/test_image_small_output.png. All done! ✨ 🍰 ✨

If you want to use another video, you can do so with the -v option. Images can be provided with the -i option and custom exemplars are configured with the -e option. If you want to run logodetect with your own, custom configuration, please provide a JSON file (like the config.json in this repo) with the -c option. That means, if you want to run detection for custom video data with custom exemplars, you should use

bash_script logodetect video -v <path-to-video> -e <path-to-exemplars-folder> -c <path-to-custom-config-json>

Minimal web application for image recognition

To run a small web app locally in your browser to upload images to recognize, simply run

commandline python app.py

and navigate to https://localhost:5000 in the browser of your choice. Also, we've hosted an online demo for you here.

On top of that, the aws folder explains in detail how to host this application yourself on Amazon Web Services. This minimalistic application can of course be extended to your own needs at any point.

Full set of CLI commands and help pages

In the last section we have already discussed the three commands exposed to users through the logodetect CLI tool, namely init, image, and video. While init does not take any parameters, the other two need a bit more explanation. Below you find the complete API reference from the respective help pages of our CLI.

Images

commandline logodetect image --help

```text Usage: logodetect image [OPTIONS]

Options: -i, --imagefilename TEXT path to your input image -c, --configfile TEXT path to file containing a logodetect config JSON -o, --output_appendix TEXT string appended to your resulting file -e, --exemplars TEXT path to your exemplars folder --help Show this message and exit. ```

Videos

commandline logodetect video --help

```text Usage: logodetect video [OPTIONS]

Options: -v, --videofilename TEXT path to your input video -c, --configfile TEXT path to file containing a logodetect config JSON -o, --output_appendix TEXT string appended to your resulting file -e, --exemplars TEXT path to your exemplars folder --help Show this message and exit. ```

Core abstractions

logodetect works with a two-phased approach. In the first phase, objects get detected with a Detector, and then get compared to and identified with exemplars in a Classifier. Both phases get integrated into the inference pipeline using a single Recognizer, in which we detect potential overlay boxes in video frames or images and then the detected boxes get labeled according to their classification.

Configuration

The specific parameter settings of the algorithms used in logodetect, i.e. options for all of our detectors, classifiers, data augmenters, and system devices used, can be changed by providing a config.json file with the -c flag in the main CLI commands explained above. The example config.json file explains the options you have and what exactly you can modify in logodetect.

Notebooks

You can find exemplary jupyter notebooks from the logodetect project in the notebooks/ folder. If you're interested in training new models, then training/notebooks/ might interest you.

Docker support

If you prefer to work with Docker, build an image and run it like this:

bash_script docker build . -t logodetect docker run -e LOGOS_RECOGNITION=/app -p 5000:5000 -t logodetect

Important: this assumes that you have previously downloaded all data and models right next to the Dockerfile in the local copy of this repo.

Automatic code linting with `black`

This project uses black for code linting. To install the git pre-commit hook for black, simply run

bash_script pre-commit install

from the base of this repository. This will run (and fail in case of grave errors) black each time you make a commit. Once CI is up for this project, we will ensure this hook runs on each CI pass. To manually use black on a file, use black <path-to-file>.

Running tests

Run all tests with pytest, or just run the quicker unit test suite with

bash_script pytest -m unit

or all longer-running integration tests with

bash_script pytest -m integration

Building the paper locally

commandline docker run --rm \ --volume $PWD:/data \ --user $(id -u):$(id -g) \ --env JOURNAL=joss \ openjournals/paperdraft

Support

For support, issues and contributions, please follow the guidelines in SUPPORT.md.

Owner

Name: Heldenkombinat Technologies GmbH
Login: Heldenkombinat
Kind: organization

Website: https://www.heldenkombinat.com/
Repositories: 1
Profile: https://github.com/Heldenkombinat

We build AI systems that accelerate productivity and discover new strategies.

JOSS Publication

Logodetect: One-shot detection of logos in image and video data

Published

April 25, 2022

DOI

10.21105/joss.03124

Volume 7, Issue 72, Page 3124

Authors

Jorge Davila-Chacon
Heldenkombinat Technologies GmbH

Max Pumperla
IUBH Internationale Hochschule, Pathmind Inc.

Editor

Ariel Rokem

CodeMeta (codemeta.json)

{
  "@context": "https://raw.githubusercontent.com/codemeta/codemeta/master/codemeta.jsonld",
  "@type": "Code",
  "author": [
    {
      "@id": "https://orcid.org/0000-0002-7801-4184",
      "@type": "Person",
      "email": "max.pumperla@googlemail.com",
      "name": "Max Pumperla",
      "affiliation": ""
    },
    {
      "@id": "",
      "@type": "Person",
      "email": "",
      "name": "Jorge Davila-Chacon",
      "affiliation": ""
    }
  ],
  "identifier": "",
  "codeRepository": "https://github.com/Heldenkombinat/Logodetect",
  "datePublished": "2021-02-05",
  "dateModified": "2021-02-05",
  "dateCreated": "2021-02-05",
  "description": "Logodetect: One-shot detection of logos in image and video data",
  "keywords": "image-processing, video-processing, object-detection, one-shot-learning, small-dataset, logo-detection",
  "license": "AGPLv3",
  "title": "logodetect",
  "version": "v0.1"
}

GitHub Events

Total

Watch event: 6
Fork event: 1

Last Year

Watch event: 6
Fork event: 1

Committers

Last synced: 7 months ago

All Time

Total Commits: 178
Total Committers: 4
Avg Commits per committer: 44.5
Development Distribution Score (DDS): 0.455

Past Year

Commits: 0
Committers: 0
Avg Commits per committer: 0.0
Development Distribution Score (DDS): 0.0

Top Committers

Name	Email	Commits
Max Pumperla	m**a@g**m	97
Jorge	j**h@g**m	79
dependabot[bot]	4****]	1
Arfon Smith	a****n	1

Issues and Pull Requests

Last synced: 6 months ago

All Time

Total issues: 20
Total pull requests: 21
Average time to close issues: about 1 month
Average time to close pull requests: about 15 hours
Total issue authors: 7
Total pull request authors: 4
Average comments per issue: 0.85
Average comments per pull request: 0.0
Merged pull requests: 20
Bot issues: 0
Bot pull requests: 2

Past Year

Issues: 0
Pull requests: 0
Average time to close issues: N/A
Average time to close pull requests: N/A
Issue authors: 0
Pull request authors: 0
Average comments per issue: 0
Average comments per pull request: 0
Merged pull requests: 0
Bot issues: 0
Bot pull requests: 0

View more stats

Top Authors

Issue Authors

maxpumperla (13)
szkafander (2)
xapu3ma (1)
OrjwanZaafarani (1)
alantaitz (1)
nsaunier (1)
theCuriousHAT (1)

Pull Request Authors

jorgedch (12)
maxpumperla (6)
dependabot[bot] (3)
arfon (1)

Top Labels

Issue Labels

Pull Request Labels

dependencies (3)

Packages

Total packages: 1
Total downloads:
- pypi 47 last-month

Total dependent packages: 0
Total dependent repositories: 1
Total versions: 7
Total maintainers: 2

pypi.org: logodetect

One-shot logo detection for videos and images.

Homepage: https://github.com/Heldenkombinat/logodetect
Documentation: https://logodetect.readthedocs.io/
License: GNU AGPLv3
Latest release: 1.1.3
published almost 4 years ago

Versions: 7
Dependent Packages: 0
Dependent Repositories: 1
Downloads: 47 Last month

Rankings

Stargazers count: 8.9%

Dependent packages count: 10.1%

Forks count: 10.9%

Average: 15.2%

Dependent repos count: 21.6%

Downloads: 24.6%

Maintainers (2)

maxpumperla jorgedch

Last synced: 6 months ago

Dependencies

requirements.txt pypi

Cython >=0.29.15
Flask >=1.1.1
Flask-Cors >=3.0.9
autopep8 *
click >=7.1.1
gunicorn >=20.0.4
imgaug >=0.4.0
jupyterlab ==3.2.4
matplotlib >=2.2.5
moviepy >=1.0.1
numpy >=1.18.2
opencv-python >=4.2.0.32
pandas >=1.0.3
pylint *
pytest >=6.2.4
scikit-image >=0.16.2
scikit-learn >=0.22.1
scipy >=1.4.1
torch >=1.9.0
torchvision >=0.10.0
tqdm >=4.42.1

setup.py pypi

Cython >=0.29.15
Flask >=1.1.1
Flask-Cors >=3.0.9
click >=7.1.1
gunicorn >=20.0.4
imgaug >=0.4.0
jupyterlab ==3.2.4
matplotlib >=2.2.5
moviepy >=1.0.1
numpy >=1.18.2
opencv-python >=4.2.0.32
pandas >=1.0.3
pytest >=6.2.4
scikit-image >=0.16.2
scikit-learn >=0.22.1
scipy >=1.4.1
torch >=1.9.0
torchvision >=0.10.0
tqdm >=4.42.1

training/requirements.txt pypi

Cython *
Pillow *
autopep8 *
ipykernel *
matplotlib *
moviepy *
numpy *
opencv-python *
pandas *
pycocotools *
pycodestyle *
pylint *
scipy *
sk-video *
tabulate *
tensorboard *
torch ==1.5.0
torchvision ==0.6.0
tqdm *

.github/workflows/draft-pdf.yml actions

actions/checkout v2 composite
actions/upload-artifact v1 composite
openjournals/openjournals-draft-action master composite

Dockerfile docker

pytorch/pytorch latest build

training/setup.py pypi

Logodetect

Science Score: 93.0%

Keywords

Keywords from Contributors

Scientific Fields

Repository

Basic Info

Statistics

Topics

Metadata Files

README.md

Introduction

References

Installation

Usage

Minimal web application for image recognition

Full set of CLI commands and help pages

Images

Videos

Core abstractions

Configuration

Notebooks

Docker support

Automatic code linting with black

Running tests

Building the paper locally

Support

Owner

JOSS Publication

Logodetect: One-shot detection of logos in image and video data

Authors

Editor

Tags

CodeMeta (codemeta.json)

GitHub Events

Total

Last Year

Committers

All Time

Past Year

Top Committers

Issues and Pull Requests

All Time

Past Year

Top Authors

Issue Authors

Pull Request Authors

Top Labels

Issue Labels

Pull Request Labels

Packages

pypi.org: logodetect

Rankings

Maintainers (2)

Dependencies

Automatic code linting with `black`