https://github.com/ammar257ammar/tika-dockers
A suite of Machine Learning / Deep Learning Dockerfiles to allow Apache Tika to extract objects and to produce textual captions for images and video
Science Score: 10.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
○CITATION.cff file
-
○codemeta.json file
-
○.zenodo.json file
-
○DOI references
-
✓Academic publication links
Links to: arxiv.org -
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (10.7%) to scientific vocabulary
Last synced: 10 months ago
·
JSON representation
Repository
A suite of Machine Learning / Deep Learning Dockerfiles to allow Apache Tika to extract objects and to produce textual captions for images and video
Basic Info
Statistics
- Stars: 0
- Watchers: 0
- Forks: 0
- Open Issues: 0
- Releases: 0
Fork of USCDataScience/tika-dockers
Created almost 5 years ago
· Last pushed over 5 years ago
https://github.com/ammar257ammar/tika-dockers/blob/master/
# Tika Image and Video Object Detection and Captioning Dockers
This repository contains Docker File Image source for the use of image and video object identification and detection and image and video captioning using [Tensorflow](http://tensorflow.org/) and the [Apache Tika](http://tika.apache.org/) toolkit. Combined with our [Image Captioning Project by USC Data Science](http://github.com/USCDataScience/img2text.git) we provide a full implementation of the [Show and Tell: A Neural Image Caption Generator](https://arxiv.org/abs/1411.4555) paper. To our knowledge there is no other implementation of this paper available in the public domain.
# To Build the Dockers
1. `docker build -f InceptionRestDockerfile -t uscdatascience/inception-rest-tika .` - Builds the Inception v4 model Image detection capability.
2. `docker build -f Im2txtRestDockerfile -t uscdatascience/im2txt-rest-tika .` - Builds the Show and Tell model Image Text Captioning capability.
3. `docker build -f InceptionVideoRestDockerfile -t uscdatascience/inception-video-rest-tika .` - Builds a Docker with OpenCV and Tensorflow that can be used to idenitfy objects in Videos.
# To Test the Dockers
1. `docker run -it -p 8764:8764 uscdatascience/inception-rest-tika` - then run the tests in [ObjectRecognitionParserTest class](https://github.com/apache/tika/blob/master/tika-parsers/src/test/java/org/apache/tika/parser/recognition/ObjectRecognitionParserTest.java)
2. `docker run -it -p 8764:8764 uscdatascience/im2txt-rest-tika` - then run the tests in [ObjectRecognitionParserTest class](https://github.com/apache/tika/blob/master/tika-parsers/src/test/java/org/apache/tika/parser/recognition/ObjectRecognitionParserTest.java)
3. `docker run -it -p 8764:8764 uscdatascience/inception-video-rest-tika` - then run the tests in [TensorflowVideoRecParserTest class](https://github.com/apache/tika/blob/master/tika-parsers/src/test/java/org/apache/tika/parser/recognition/tf/TensorflowVideoRecParserTest.java)
# Citation:
If you use this work, please cite first [USC Data Science](https://github.com/USCDataScience/tika-dockers) and then:
```
@article{DBLP:journals/corr/VinyalsTBE14,
author = {Oriol Vinyals and
Alexander Toshev and
Samy Bengio and
Dumitru Erhan},
title = {Show and Tell: {A} Neural Image Caption Generator},
journal = {CoRR},
volume = {abs/1411.4555},
year = {2014},
url = {http://arxiv.org/abs/1411.4555},
archivePrefix = {arXiv},
eprint = {1411.4555},
timestamp = {Wed, 07 Jun 2017 14:41:10 +0200},
biburl = {http://dblp.org/rec/bib/journals/corr/VinyalsTBE14},
bibsource = {dblp computer science bibliography, http://dblp.org}
}
```
See Also
========
The [Tika Advanced Object Recognition Page](https://wiki.apache.org/tika/FrontPage#Object_Recognition_.28Computer_Vision.29_support) for more advanced tutorials on how to use the Dockers with Tika.
Questions, comments?
===================
Send them to [Chris A. Mattmann](mailto:chris.a.mattmann@jpl.nasa.gov) or [Thejan Wijesinghe](mailto:thejan.k.wijesinghe@gmail.com).
Contributors
============
* Chris A. Mattmann, JPL & USC
* Thejan Wijesinghe, University of Moratuwa
License
===
This project is licensed under the [Apache License, version 2.0](http://www.apache.org/licenses/LICENSE-2.0).
Owner
- Name: Ammar Ammar
- Login: ammar257ammar
- Kind: user
- Location: The Netherlands
- Company: Maastricht University
- Repositories: 14
- Profile: https://github.com/ammar257ammar