tesseract-ocr
Tesseract Open Source OCR Engine (main repository)
Science Score: 54.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
○DOI references
-
○Academic publication links
-
✓Committers with academic emails
4 of 221 committers (1.8%) from academic institutions -
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (15.1%) to scientific vocabulary
Keywords
Keywords from Contributors
Repository
Tesseract Open Source OCR Engine (main repository)
Basic Info
- Host: GitHub
- Owner: tesseract-ocr
- License: apache-2.0
- Language: C++
- Default Branch: main
- Homepage: https://tesseract-ocr.github.io/
- Size: 51.2 MB
Statistics
- Stars: 68,733
- Watchers: 1,702
- Forks: 10,113
- Open Issues: 452
- Releases: 34
Topics
Metadata Files
README.md
Tesseract OCR
Table of Contents
About
This package contains an OCR engine - libtesseract and a command line program - tesseract.
Tesseract 4 adds a new neural net (LSTM) based OCR engine which is focused on line recognition, but also still supports the legacy Tesseract OCR engine of Tesseract 3 which works by recognizing character patterns. Compatibility with Tesseract 3 is enabled by using the Legacy OCR Engine mode (--oem 0). It also needs traineddata files which support the legacy engine, for example those from the tessdata repository.
Stefan Weil is the current lead developer. Ray Smith was the lead developer until 2018. The maintainer is Zdenko Podobny. For a list of contributors see AUTHORS and GitHub's log of contributors.
Tesseract has unicode (UTF-8) support, and can recognize more than 100 languages "out of the box".
Tesseract supports various image formats including PNG, JPEG and TIFF.
Tesseract supports various output formats: plain text, hOCR (HTML), PDF, invisible-text-only PDF, TSV, ALTO and PAGE.
You should note that in many cases, in order to get better OCR results, you'll need to improve the quality of the image you are giving Tesseract.
This project does not include a GUI application. If you need one, please see the 3rdParty documentation.
Tesseract can be trained to recognize other languages. See Tesseract Training for more information.
Brief history
Tesseract was originally developed at Hewlett-Packard Laboratories Bristol UK and at Hewlett-Packard Co, Greeley Colorado USA between 1985 and 1994, with some more changes made in 1996 to port to Windows, and some C++izing in 1998. In 2005 Tesseract was open sourced by HP. From 2006 until November 2018 it was developed by Google.
Major version 5 is the current stable version and started with release 5.0.0 on November 30, 2021. Newer minor versions and bugfix versions are available from GitHub.
Latest source code is available from main branch on GitHub. Open issues can be found in issue tracker, and planning documentation.
See Release Notes and Change Log for more details of the releases.
Installing Tesseract
You can either Install Tesseract via pre-built binary package or build it from source.
Before building Tesseract from source, please check that your system has a compiler which is one of the supported compilers.
Running Tesseract
Basic command line usage:
tesseract imagename outputbase [-l lang] [--oem ocrenginemode] [--psm pagesegmode] [configfiles...]
For more information about the various command line options use tesseract --help or man tesseract.
Examples can be found in the documentation.
For developers
Developers can use libtesseract C or
C++ API to build their own application. If you need bindings to libtesseract for other programming languages, please see the
wrapper section in the AddOns documentation.
Documentation of Tesseract generated from source code by doxygen can be found on tesseract-ocr.github.io.
Support
Before you submit an issue, please review the guidelines for this repository.
For support, first read the documentation, particularly the FAQ to see if your problem is addressed there. If not, search the Tesseract user forum, the Tesseract developer forum and past issues, and if you still can't find what you need, ask for support in the mailing-lists.
Mailing-lists:
- tesseract-ocr - For tesseract users.
- tesseract-dev - For tesseract developers.
Please report an issue only for a bug, not for asking questions.
License
The code in this repository is licensed under the Apache License, Version 2.0 (the "License");
you may not use this file except in compliance with the License.
You may obtain a copy of the License at
http://www.apache.org/licenses/LICENSE-2.0
Unless required by applicable law or agreed to in writing, software
distributed under the License is distributed on an "AS IS" BASIS,
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
See the License for the specific language governing permissions and
limitations under the License.
NOTE: This software depends on other packages that may be licensed under different open source licenses.
Tesseract uses Leptonica library which essentially uses a BSD 2-clause license.
Dependencies
Tesseract uses Leptonica library for opening input images (e.g. not documents like pdf). It is suggested to use leptonica with built-in support for zlib, png and tiff (for multipage tiff).
Latest Version of README
For the latest online version of the README.md see:
https://github.com/tesseract-ocr/tesseract/blob/main/README.md
Owner
- Name: tesseract-ocr
- Login: tesseract-ocr
- Kind: organization
- Website: https://github.com/tesseract-ocr/
- Repositories: 14
- Profile: https://github.com/tesseract-ocr
Tesseract OCR
Citation (CITATIONS.bib)
@inproceedings{TableDetect,
author = {Faisal Shafait and Ray Smith},
booktitle = {Document Analysis Systems},
editor = {David S. Doermann and Venu Govindaraju and Daniel P. Lopresti and Premkumar Natarajan},
pages = {65--72},
publisher = {ACM},
series = {ACM International Conference Proceeding Series},
title = {Table detection in heterogeneous documents.},
url = {http://dblp.uni-trier.de/db/conf/das/das2010.html#ShafaitS10},
year = 2010,
isbn = {978-1-60558-773-8},
date = {2010-07-07}
}
@inproceedings{Multilingual,
author = {Ray Smith and Daria Antonova and Dar-Shyang Lee},
booktitle = {MOCR '09: Proceedings of the International Workshop on Multilingual OCR},
editor = {Venu Govindaraju and Premkumar Natarajan and Santanu Chaudhury and Daniel P. Lopresti},
pages = {1--8},
publisher = {ACM},
series = {ACM International Conference Proceeding Series},
title = {Adapting the Tesseract Open Source OCR Engine for Multilingual OCR.},
url = {https://storage.googleapis.com/pub-tools-public-publication-data/pdf/35248.pdf},
year = 2009,
isbn = {978-1-60558-698-4},
date = {2009-07-25},
doi = {http://doi.acm.org/10/1145/1577802.1577804},
location = {Barcelona, Spain},
}
@inproceedings{ScriptDetect,
author = {Ranjith Unnikrishnan and Ray Smith},
title = {Combined Orientation and Script Detection using the Tesseract OCR Engine},
booktitle = {MOCR '09: Proceedings of the International Workshop on Multilingual OCR},
editor = {Venu Govindaraju and Premkumar Natarajan and Santanu Chaudhury and Daniel P. Lopresti},
url = {https://storage.googleapis.com/pub-tools-public-publication-data/pdf/35506.pdf},
year = {2009},
isbn = {978-1-60558-698-4},
pages = {1--7},
location = {Barcelona, Spain},
doi = {http://doi.acm.org/10.1145/1577802.1577809},
publisher = {ACM},
address = {New York, NY, USA},
}
@inproceedings{PageLayout,
author = {Ray Smith},
title = {Hybrid Page Layout Analysis via Tab-Stop Detection},
booktitle = {ICDAR '09: Proceedings of the 2009 10th International Conference on Document Analysis and Recognition},
url = {https://storage.googleapis.com/pub-tools-public-publication-data/pdf/35094.pdf},
year = {2009},
isbn = {978-0-7695-3725-2},
pages = {241--245},
doi = {http://dx.doi.org/10.1109/ICDAR.2009.257},
publisher = {IEEE Computer Society},
address = {Washington, DC, USA},
}
@inproceedings{TessOverview,
author = {Ray Smith},
title = {An Overview of the Tesseract OCR Engine},
booktitle = {ICDAR '07: Proceedings of the Ninth International Conference on Document Analysis and Recognition},
url = {https://storage.googleapis.com/pub-tools-public-publication-data/pdf/33418.pdf},
year = {2007},
isbn = {0-7695-2822-8},
pages = {629--633},
publisher = {IEEE Computer Society},
address = {Washington, DC, USA},
}
Committers
Last synced: 9 months ago
Top Committers
| Name | Commits | |
|---|---|---|
| Stefan Weil | sw@w****e | 2,340 |
| Zdenko Podobný | z****p@g****m | 703 |
| Egor Pugin | e****n@g****m | 626 |
| Ray Smith | r****s@g****m | 557 |
| Shree Devi Kumar | s****i@g****m | 203 |
| Amit Dovev | a****2@g****m | 191 |
| Jim O'Regan | j****n@g****m | 131 |
| Noah Metzger | n****r@b****e | 30 |
| Nick White | g****t@n****e | 24 |
| Robert Schubert | r****t@m****m | 22 |
| Raf Schietekat | r****t | 21 |
| david.eger@gmail.com | d****r@g****m@d****0 | 20 |
| Alexander Zaitsev | z****k@t****y | 18 |
| Robin Watts | R****s@a****m | 17 |
| James R. Barlow | j****m@p****a | 17 |
| tmbdev | t****v@d****0 | 16 |
| Tom Morris | t****s@g****m | 14 |
| Shatur95 | g****r@g****m | 12 |
| zhuangzhuang1988 | z****8@o****m | 11 |
| Jeff Breidenbach | b****h@g****m | 10 |
| Robert Pösel | p****l@a****z | 8 |
| Sundar M. Vaidya | s****r@c****m | 8 |
| 林博仁(Buo-ren, Lin) | B****n@g****m | 8 |
| Ger Hobbelt | g****r@h****m | 8 |
| Jaroslaw Kubik | j****k@f****m | 8 |
| max.markin@gmail.com | m****n@g****m@d****0 | 7 |
| mezhirov | m****v@d****0 | 7 |
| chrismamo1 | c****e@g****m | 7 |
| Guido Vranken | g****n | 6 |
| Parryword | 1****d | 6 |
| and 191 more... | ||
Committer Domains (Top 20 + Academic)
Issues and Pull Requests
Last synced: 6 months ago
All Time
- Total issues: 404
- Total pull requests: 190
- Average time to close issues: 5 months
- Average time to close pull requests: 23 days
- Total issue authors: 314
- Total pull request authors: 62
- Average comments per issue: 7.76
- Average comments per pull request: 3.16
- Merged pull requests: 121
- Bot issues: 0
- Bot pull requests: 0
Past Year
- Issues: 88
- Pull requests: 52
- Average time to close issues: 1 day
- Average time to close pull requests: 13 days
- Issue authors: 72
- Pull request authors: 18
- Average comments per issue: 1.99
- Average comments per pull request: 2.79
- Merged pull requests: 33
- Bot issues: 0
- Bot pull requests: 0
Top Authors
Issue Authors
- amitdo (22)
- Shreeshrii (11)
- stweil (11)
- l1t1 (6)
- yurivict (4)
- danpla (3)
- bruzzler5 (3)
- nissansz (3)
- DesBw (3)
- marcreichman-pfi (3)
- eddilara (3)
- Balearica (3)
- CanadianHusky (3)
- ghost (3)
- bertsky (2)
Pull Request Authors
- stweil (112)
- zdenop (15)
- LLotme (14)
- sadra-barikbin (13)
- AndersonMartins1 (8)
- Balearica (7)
- bertsky (6)
- heshpdx (6)
- tfmorris (5)
- JKamlah (5)
- GerHobbelt (4)
- ayush-pandey047 (4)
- brad0 (4)
- phymbert (4)
- Parryword (4)
Top Labels
Issue Labels
Pull Request Labels
Packages
- Total packages: 100
- Total downloads: unknown
-
Total dependent packages: 0
(may contain duplicates) -
Total dependent repositories: 0
(may contain duplicates) - Total versions: 329
- Total maintainers: 1
alpine-v3.22: tesseract-ocr-data-bel
Tesseract language data for bel
- Homepage: https://github.com/tesseract-ocr/tesseract/releases
- License: Apache-2.0
-
Latest release: 5.5.0-r2
published 11 months ago
Rankings
Maintainers (1)
alpine-v3.22: tesseract-ocr-data-aze
Tesseract language data for aze
- Homepage: https://github.com/tesseract-ocr/tesseract/releases
- License: Apache-2.0
-
Latest release: 5.5.0-r2
published 11 months ago
Rankings
Maintainers (1)
alpine-v3.18: tesseract-ocr-data-equ
Tesseract language data for equ
- Homepage: https://github.com/tesseract-ocr/tesseract/releases
- License: Apache-2.0
-
Latest release: 5.3.2-r0
published over 2 years ago
Rankings
Maintainers (1)
alpine-v3.18: tesseract-ocr-data-msa
Tesseract language data for msa
- Homepage: https://github.com/tesseract-ocr/tesseract/releases
- License: Apache-2.0
-
Latest release: 5.3.2-r0
published over 2 years ago
Rankings
Maintainers (1)
alpine-v3.18: tesseract-ocr-data-slk
Tesseract language data for slk
- Homepage: https://github.com/tesseract-ocr/tesseract/releases
- License: Apache-2.0
-
Latest release: 5.3.2-r0
published over 2 years ago
Rankings
Maintainers (1)
alpine-v3.18: tesseract-ocr
open source OCR engine
- Homepage: https://github.com/tesseract-ocr/tesseract/releases
- License: Apache-2.0
-
Latest release: 5.3.2-r0
published over 2 years ago
Rankings
Maintainers (1)
alpine-edge: tesseract-ocr-data-khm
Tesseract language data for khm
- Homepage: https://github.com/tesseract-ocr/tesseract/releases
- License: Apache-2.0
-
Latest release: 5.5.1-r0
published 8 months ago
Rankings
Maintainers (1)
alpine-edge: tesseract-ocr-data-osd
Tesseract language data for osd
- Homepage: https://github.com/tesseract-ocr/tesseract/releases
- License: Apache-2.0
-
Latest release: 5.5.1-r0
published 8 months ago
Rankings
Maintainers (1)
alpine-edge: tesseract-ocr-data-bul
Tesseract language data for bul
- Homepage: https://github.com/tesseract-ocr/tesseract/releases
- License: Apache-2.0
-
Latest release: 5.5.1-r0
published 8 months ago
Rankings
Maintainers (1)
alpine-edge: tesseract-ocr-data-fin
Tesseract language data for fin
- Homepage: https://github.com/tesseract-ocr/tesseract/releases
- License: Apache-2.0
-
Latest release: 5.5.1-r0
published 8 months ago
Rankings
Maintainers (1)
alpine-edge: tesseract-ocr-data-por
Tesseract language data for por
- Homepage: https://github.com/tesseract-ocr/tesseract/releases
- License: Apache-2.0
-
Latest release: 5.5.1-r0
published 8 months ago
Rankings
Maintainers (1)
alpine-edge: tesseract-ocr-data-lav
Tesseract language data for lav
- Homepage: https://github.com/tesseract-ocr/tesseract/releases
- License: Apache-2.0
-
Latest release: 5.5.1-r0
published 8 months ago
Rankings
Maintainers (1)
alpine-edge: tesseract-ocr-data-hin
Tesseract language data for hin
- Homepage: https://github.com/tesseract-ocr/tesseract/releases
- License: Apache-2.0
-
Latest release: 5.5.1-r0
published 8 months ago
Rankings
Maintainers (1)
alpine-edge: tesseract-ocr-data-nld
Tesseract language data for nld
- Homepage: https://github.com/tesseract-ocr/tesseract/releases
- License: Apache-2.0
-
Latest release: 5.5.1-r0
published 8 months ago
Rankings
Maintainers (1)
alpine-edge: tesseract-ocr-data-chi_tra
Tesseract language data for chi_tra
- Homepage: https://github.com/tesseract-ocr/tesseract/releases
- License: Apache-2.0
-
Latest release: 5.5.1-r0
published 8 months ago
Rankings
Maintainers (1)
alpine-edge: tesseract-ocr-data-mlt
Tesseract language data for mlt
- Homepage: https://github.com/tesseract-ocr/tesseract/releases
- License: Apache-2.0
-
Latest release: 5.5.1-r0
published 8 months ago
Rankings
Maintainers (1)
alpine-edge: tesseract-ocr-data-sqi
Tesseract language data for sqi
- Homepage: https://github.com/tesseract-ocr/tesseract/releases
- License: Apache-2.0
-
Latest release: 5.5.1-r0
published 8 months ago
Rankings
Maintainers (1)
alpine-edge: tesseract-ocr-dev
open source OCR engine (development files)
- Homepage: https://github.com/tesseract-ocr/tesseract/releases
- License: Apache-2.0
-
Latest release: 5.5.1-r0
published 8 months ago
Rankings
Maintainers (1)
alpine-edge: tesseract-ocr-data-slv
Tesseract language data for slv
- Homepage: https://github.com/tesseract-ocr/tesseract/releases
- License: Apache-2.0
-
Latest release: 5.5.1-r0
published 8 months ago
Rankings
Maintainers (1)
alpine-edge: tesseract-ocr-data-swa
Tesseract language data for swa
- Homepage: https://github.com/tesseract-ocr/tesseract/releases
- License: Apache-2.0
-
Latest release: 5.5.1-r0
published 8 months ago
Rankings
Maintainers (1)
alpine-edge: tesseract-ocr-data-hun
Tesseract language data for hun
- Homepage: https://github.com/tesseract-ocr/tesseract/releases
- License: Apache-2.0
-
Latest release: 5.5.1-r0
published 8 months ago
Rankings
Maintainers (1)
alpine-edge: tesseract-ocr-data-tel
Tesseract language data for tel
- Homepage: https://github.com/tesseract-ocr/tesseract/releases
- License: Apache-2.0
-
Latest release: 5.5.1-r0
published 8 months ago
Rankings
Maintainers (1)
alpine-v3.13: tesseract-ocr-data-deu
Tesseract language data for deu
- Homepage: https://github.com/tesseract-ocr/tesseract/releases
- License: Apache-2.0
-
Latest release: 4.1.1-r5
published about 5 years ago
Rankings
Maintainers (1)
alpine-v3.12: tesseract-ocr-data-frk
Tesseract language data for frk
- Homepage: https://github.com/tesseract-ocr/tesseract/releases
- License: Apache
-
Latest release: 4.1.1-r3
published almost 6 years ago
Rankings
Maintainers (1)
alpine-v3.12: tesseract-ocr-data-cat
Tesseract language data for cat
- Homepage: https://github.com/tesseract-ocr/tesseract/releases
- License: Apache
-
Latest release: 4.1.1-r3
published almost 6 years ago
Rankings
Maintainers (1)
alpine-v3.12: tesseract-ocr-data-ita_old
Tesseract language data for ita_old
- Homepage: https://github.com/tesseract-ocr/tesseract/releases
- License: Apache
-
Latest release: 4.1.1-r3
published almost 6 years ago
Rankings
Maintainers (1)
alpine-v3.12: tesseract-ocr-data-spa_old
Tesseract language data for spa_old
- Homepage: https://github.com/tesseract-ocr/tesseract/releases
- License: Apache
-
Latest release: 4.1.1-r3
published almost 6 years ago
Rankings
Maintainers (1)
alpine-v3.12: tesseract-ocr-data-ita
Tesseract language data for ita
- Homepage: https://github.com/tesseract-ocr/tesseract/releases
- License: Apache
-
Latest release: 4.1.1-r3
published almost 6 years ago
Rankings
Maintainers (1)
alpine-v3.14: tesseract-ocr-data-mal
Tesseract language data for mal
- Homepage: https://github.com/tesseract-ocr/tesseract/releases
- License: Apache-2.0
-
Latest release: 4.1.1-r5
published about 5 years ago
Rankings
Maintainers (1)
alpine-v3.14: tesseract-ocr-data-hin
Tesseract language data for hin
- Homepage: https://github.com/tesseract-ocr/tesseract/releases
- License: Apache-2.0
-
Latest release: 4.1.1-r5
published about 5 years ago
Rankings
Maintainers (1)
alpine-v3.14: tesseract-ocr-data-glg
Tesseract language data for glg
- Homepage: https://github.com/tesseract-ocr/tesseract/releases
- License: Apache-2.0
-
Latest release: 4.1.1-r5
published about 5 years ago
Rankings
Maintainers (1)
alpine-v3.14: tesseract-ocr-data-lav
Tesseract language data for lav
- Homepage: https://github.com/tesseract-ocr/tesseract/releases
- License: Apache-2.0
-
Latest release: 4.1.1-r5
published about 5 years ago
Rankings
Maintainers (1)
alpine-v3.14: tesseract-ocr-data-kor
Tesseract language data for kor
- Homepage: https://github.com/tesseract-ocr/tesseract/releases
- License: Apache-2.0
-
Latest release: 4.1.1-r5
published about 5 years ago
Rankings
Maintainers (1)
alpine-v3.14: tesseract-ocr-data-spa_old
Tesseract language data for spa_old
- Homepage: https://github.com/tesseract-ocr/tesseract/releases
- License: Apache-2.0
-
Latest release: 4.1.1-r5
published about 5 years ago
Rankings
Maintainers (1)
alpine-v3.14: tesseract-ocr-data-ell
Tesseract language data for ell
- Homepage: https://github.com/tesseract-ocr/tesseract/releases
- License: Apache-2.0
-
Latest release: 4.1.1-r5
published about 5 years ago
Rankings
Maintainers (1)
alpine-v3.14: tesseract-ocr-dev
open source OCR engine (development files)
- Homepage: https://github.com/tesseract-ocr/tesseract/releases
- License: Apache-2.0
-
Latest release: 4.1.1-r5
published about 5 years ago
Rankings
Maintainers (1)
alpine-v3.14: tesseract-ocr-data-rus
Tesseract language data for rus
- Homepage: https://github.com/tesseract-ocr/tesseract/releases
- License: Apache-2.0
-
Latest release: 4.1.1-r5
published about 5 years ago
Rankings
Maintainers (1)
alpine-v3.14: tesseract-ocr-data-grc
Tesseract language data for grc
- Homepage: https://github.com/tesseract-ocr/tesseract/releases
- License: Apache-2.0
-
Latest release: 4.1.1-r5
published about 5 years ago
Rankings
Maintainers (1)
alpine-v3.11: tesseract-ocr-data-rus
Tesseract language data for rus
- Homepage: https://github.com/tesseract-ocr/tesseract/releases
- License: Apache
-
Latest release: 4.1.0-r0
published over 6 years ago
Rankings
Maintainers (1)
alpine-v3.11: tesseract-ocr-data-enm
Tesseract language data for enm
- Homepage: https://github.com/tesseract-ocr/tesseract/releases
- License: Apache
-
Latest release: 4.1.0-r0
published over 6 years ago
Rankings
Maintainers (1)
alpine-v3.11: tesseract-ocr-data-fin
Tesseract language data for fin
- Homepage: https://github.com/tesseract-ocr/tesseract/releases
- License: Apache
-
Latest release: 4.1.0-r0
published over 6 years ago
Rankings
Maintainers (1)
alpine-v3.9: tesseract-ocr-data-cat
Tesseract language data for cat
- Homepage: https://github.com/tesseract-ocr/tesseract/releases
- License: Apache
-
Latest release: 4.0.0-r0
published about 7 years ago
Rankings
Maintainers (1)
alpine-v3.9: tesseract-ocr-data-vie
Tesseract language data for vie
- Homepage: https://github.com/tesseract-ocr/tesseract/releases
- License: Apache
-
Latest release: 4.0.0-r0
published about 7 years ago
Rankings
Maintainers (1)
alpine-v3.9: tesseract-ocr-data-frk
Tesseract language data for frk
- Homepage: https://github.com/tesseract-ocr/tesseract/releases
- License: Apache
-
Latest release: 4.0.0-r0
published about 7 years ago
Rankings
Maintainers (1)
alpine-v3.9: tesseract-ocr-data-equ
Tesseract language data for equ
- Homepage: https://github.com/tesseract-ocr/tesseract/releases
- License: Apache
-
Latest release: 4.0.0-r0
published about 7 years ago
Rankings
Maintainers (1)
alpine-v3.9: tesseract-ocr-data-ara
Tesseract language data for ara
- Homepage: https://github.com/tesseract-ocr/tesseract/releases
- License: Apache
-
Latest release: 4.0.0-r0
published about 7 years ago
Rankings
Maintainers (1)
alpine-v3.9: tesseract-ocr-data-ita
Tesseract language data for ita
- Homepage: https://github.com/tesseract-ocr/tesseract/releases
- License: Apache
-
Latest release: 4.0.0-r0
published about 7 years ago
Rankings
Maintainers (1)
alpine-v3.7: tesseract-ocr-data-chi_sim
Tesseract language data for chi_sim
- Homepage: https://github.com/tesseract-ocr/tesseract/releases
- License: Apache
-
Latest release: 3.05.01-r2
published over 8 years ago
Rankings
Maintainers (1)
alpine-v3.7: tesseract-ocr-data-ita
Tesseract language data for ita
- Homepage: https://github.com/tesseract-ocr/tesseract/releases
- License: Apache
-
Latest release: 3.05.01-r2
published over 8 years ago
Rankings
Maintainers (1)
alpine-v3.7: tesseract-ocr-data-sqi
Tesseract language data for sqi
- Homepage: https://github.com/tesseract-ocr/tesseract/releases
- License: Apache
-
Latest release: 3.05.01-r2
published over 8 years ago
Rankings
Maintainers (1)
alpine-v3.7: tesseract-ocr-data-ell
Tesseract language data for ell
- Homepage: https://github.com/tesseract-ocr/tesseract/releases
- License: Apache
-
Latest release: 3.05.01-r2
published over 8 years ago
Rankings
Maintainers (1)
alpine-v3.7: tesseract-ocr-data-tgl
Tesseract language data for tgl
- Homepage: https://github.com/tesseract-ocr/tesseract/releases
- License: Apache
-
Latest release: 3.05.01-r2
published over 8 years ago
Rankings
Maintainers (1)
alpine-v3.8: tesseract-ocr-data-msa
Tesseract language data for msa
- Homepage: https://github.com/tesseract-ocr/tesseract/releases
- License: Apache
-
Latest release: 3.05.02-r0
published over 7 years ago
Rankings
Maintainers (1)
alpine-v3.8: tesseract-ocr-data-srp
Tesseract language data for srp
- Homepage: https://github.com/tesseract-ocr/tesseract/releases
- License: Apache
-
Latest release: 3.05.02-r0
published over 7 years ago
Rankings
Maintainers (1)
alpine-v3.8: tesseract-ocr-data-fra
Tesseract language data for fra
- Homepage: https://github.com/tesseract-ocr/tesseract/releases
- License: Apache
-
Latest release: 3.05.02-r0
published over 7 years ago
Rankings
Maintainers (1)
alpine-v3.8: tesseract-ocr-data-chi_sim
Tesseract language data for chi_sim
- Homepage: https://github.com/tesseract-ocr/tesseract/releases
- License: Apache
-
Latest release: 3.05.02-r0
published over 7 years ago
Rankings
Maintainers (1)
alpine-v3.8: tesseract-ocr-data-kan
Tesseract language data for kan
- Homepage: https://github.com/tesseract-ocr/tesseract/releases
- License: Apache
-
Latest release: 3.05.02-r0
published over 7 years ago
Rankings
Maintainers (1)
alpine-v3.8: tesseract-ocr-data-hrv
Tesseract language data for hrv
- Homepage: https://github.com/tesseract-ocr/tesseract/releases
- License: Apache
-
Latest release: 3.05.02-r0
published over 7 years ago
Rankings
Maintainers (1)
alpine-v3.8: tesseract-ocr-doc
open source OCR engine (documentation)
- Homepage: https://github.com/tesseract-ocr/tesseract/releases
- License: Apache
-
Latest release: 3.05.02-r0
published over 7 years ago
Rankings
Maintainers (1)
alpine-v3.8: tesseract-ocr-data-lit
Tesseract language data for lit
- Homepage: https://github.com/tesseract-ocr/tesseract/releases
- License: Apache
-
Latest release: 3.05.02-r0
published over 7 years ago
Rankings
Maintainers (1)
alpine-v3.8: tesseract-ocr-data-afr
Tesseract language data for afr
- Homepage: https://github.com/tesseract-ocr/tesseract/releases
- License: Apache
-
Latest release: 3.05.02-r0
published over 7 years ago
Rankings
Maintainers (1)
alpine-v3.15: tesseract-ocr-data-vie
Tesseract language data for vie
- Homepage: https://github.com/tesseract-ocr/tesseract/releases
- License: Apache-2.0
-
Latest release: 4.1.3-r0
published over 4 years ago
Rankings
Maintainers (1)
alpine-v3.15: tesseract-ocr-data-spa
Tesseract language data for spa
- Homepage: https://github.com/tesseract-ocr/tesseract/releases
- License: Apache-2.0
-
Latest release: 4.1.3-r0
published over 4 years ago
Rankings
Maintainers (1)
alpine-v3.15: tesseract-ocr-data-aze
Tesseract language data for aze
- Homepage: https://github.com/tesseract-ocr/tesseract/releases
- License: Apache-2.0
-
Latest release: 4.1.3-r0
published over 4 years ago
Rankings
Maintainers (1)
alpine-v3.15: tesseract-ocr-data-lav
Tesseract language data for lav
- Homepage: https://github.com/tesseract-ocr/tesseract/releases
- License: Apache-2.0
-
Latest release: 4.1.3-r0
published over 4 years ago
Rankings
Maintainers (1)
alpine-v3.15: tesseract-ocr-data-fin
Tesseract language data for fin
- Homepage: https://github.com/tesseract-ocr/tesseract/releases
- License: Apache-2.0
-
Latest release: 4.1.3-r0
published over 4 years ago
Rankings
Maintainers (1)
alpine-v3.15: tesseract-ocr-data-tam
Tesseract language data for tam
- Homepage: https://github.com/tesseract-ocr/tesseract/releases
- License: Apache-2.0
-
Latest release: 4.1.3-r0
published over 4 years ago
Rankings
Maintainers (1)
alpine-v3.15: tesseract-ocr-data-swa
Tesseract language data for swa
- Homepage: https://github.com/tesseract-ocr/tesseract/releases
- License: Apache-2.0
-
Latest release: 4.1.3-r0
published over 4 years ago
Rankings
Maintainers (1)
alpine-v3.15: tesseract-ocr-data-hun
Tesseract language data for hun
- Homepage: https://github.com/tesseract-ocr/tesseract/releases
- License: Apache-2.0
-
Latest release: 4.1.3-r0
published over 4 years ago
Rankings
Maintainers (1)
alpine-v3.15: tesseract-ocr-data-bul
Tesseract language data for bul
- Homepage: https://github.com/tesseract-ocr/tesseract/releases
- License: Apache-2.0
-
Latest release: 4.1.3-r0
published over 4 years ago
Rankings
Maintainers (1)
alpine-v3.15: tesseract-ocr-data-eus
Tesseract language data for eus
- Homepage: https://github.com/tesseract-ocr/tesseract/releases
- License: Apache-2.0
-
Latest release: 4.1.3-r0
published over 4 years ago
Rankings
Maintainers (1)
alpine-v3.15: tesseract-ocr-data-chr
Tesseract language data for chr
- Homepage: https://github.com/tesseract-ocr/tesseract/releases
- License: Apache-2.0
-
Latest release: 4.1.3-r0
published over 4 years ago
Rankings
Maintainers (1)
alpine-v3.16: tesseract-ocr-data-frk
Tesseract language data for frk
- Homepage: https://github.com/tesseract-ocr/tesseract/releases
- License: Apache-2.0
-
Latest release: 5.1.0-r1
published almost 4 years ago
Rankings
Maintainers (1)
alpine-v3.16: tesseract-ocr-data-isl
Tesseract language data for isl
- Homepage: https://github.com/tesseract-ocr/tesseract/releases
- License: Apache-2.0
-
Latest release: 5.1.0-r1
published almost 4 years ago
Rankings
Maintainers (1)
alpine-v3.16: tesseract-ocr-data-lit
Tesseract language data for lit
- Homepage: https://github.com/tesseract-ocr/tesseract/releases
- License: Apache-2.0
-
Latest release: 5.1.0-r1
published almost 4 years ago
Rankings
Maintainers (1)
alpine-v3.17: tesseract-ocr-data-nld
Tesseract language data for nld
- Homepage: https://github.com/tesseract-ocr/tesseract/releases
- License: Apache-2.0
-
Latest release: 5.2.0-r1
published over 3 years ago
Rankings
Maintainers (1)
alpine-v3.17: tesseract-ocr-data-spa_old
Tesseract language data for spa_old
- Homepage: https://github.com/tesseract-ocr/tesseract/releases
- License: Apache-2.0
-
Latest release: 5.2.0-r1
published over 3 years ago
Rankings
Maintainers (1)
alpine-v3.17: tesseract-ocr-data-chi_tra
Tesseract language data for chi_tra
- Homepage: https://github.com/tesseract-ocr/tesseract/releases
- License: Apache-2.0
-
Latest release: 5.2.0-r1
published over 3 years ago
Rankings
Maintainers (1)
alpine-v3.10: tesseract-ocr-data-vie
Tesseract language data for vie
- Homepage: https://github.com/tesseract-ocr/tesseract/releases
- License: Apache
-
Latest release: 4.0.0-r2
published almost 7 years ago
Rankings
Maintainers (1)
alpine-v3.22: tesseract-ocr-data-slk
Tesseract language data for slk
- Homepage: https://github.com/tesseract-ocr/tesseract/releases
- License: Apache-2.0
-
Latest release: 5.5.0-r2
published 11 months ago
Rankings
Maintainers (1)
alpine-v3.22: tesseract-ocr-data-ita_old
Tesseract language data for ita_old
- Homepage: https://github.com/tesseract-ocr/tesseract/releases
- License: Apache-2.0
-
Latest release: 5.5.0-r2
published 11 months ago
Rankings
Maintainers (1)
alpine-v3.22: tesseract-ocr-data-nld
Tesseract language data for nld
- Homepage: https://github.com/tesseract-ocr/tesseract/releases
- License: Apache-2.0
-
Latest release: 5.5.0-r2
published 11 months ago
Rankings
Maintainers (1)
alpine-v3.22: tesseract-ocr-data-vie
Tesseract language data for vie
- Homepage: https://github.com/tesseract-ocr/tesseract/releases
- License: Apache-2.0
-
Latest release: 5.5.0-r2
published 11 months ago
Rankings
Maintainers (1)
alpine-v3.22: tesseract-ocr-data-spa
Tesseract language data for spa
- Homepage: https://github.com/tesseract-ocr/tesseract/releases
- License: Apache-2.0
-
Latest release: 5.5.0-r2
published 11 months ago
Rankings
Maintainers (1)
alpine-v3.22: tesseract-ocr-data-kor
Tesseract language data for kor
- Homepage: https://github.com/tesseract-ocr/tesseract/releases
- License: Apache-2.0
-
Latest release: 5.5.0-r2
published 11 months ago
Rankings
Maintainers (1)
alpine-v3.22: tesseract-ocr-data-tha
Tesseract language data for tha
- Homepage: https://github.com/tesseract-ocr/tesseract/releases
- License: Apache-2.0
-
Latest release: 5.5.0-r2
published 11 months ago
Rankings
Maintainers (1)
alpine-v3.19: tesseract-ocr-data-khm
Tesseract language data for khm
- Homepage: https://github.com/tesseract-ocr/tesseract/releases
- License: Apache-2.0
-
Latest release: 5.3.3-r1
published over 2 years ago
Rankings
Maintainers (1)
alpine-v3.22: tesseract-ocr-data-lit
Tesseract language data for lit
- Homepage: https://github.com/tesseract-ocr/tesseract/releases
- License: Apache-2.0
-
Latest release: 5.5.0-r2
published 11 months ago
Rankings
Maintainers (1)
alpine-v3.22: tesseract-ocr-data-frm
Tesseract language data for frm
- Homepage: https://github.com/tesseract-ocr/tesseract/releases
- License: Apache-2.0
-
Latest release: 5.5.0-r2
published 11 months ago
Rankings
Maintainers (1)
alpine-v3.22: tesseract-ocr-data-por
Tesseract language data for por
- Homepage: https://github.com/tesseract-ocr/tesseract/releases
- License: Apache-2.0
-
Latest release: 5.5.0-r2
published 11 months ago
Rankings
Maintainers (1)
alpine-v3.22: tesseract-ocr-data-chi_sim
Tesseract language data for chi_sim
- Homepage: https://github.com/tesseract-ocr/tesseract/releases
- License: Apache-2.0
-
Latest release: 5.5.0-r2
published 11 months ago
Rankings
Maintainers (1)
alpine-v3.22: tesseract-ocr-data-hin
Tesseract language data for hin
- Homepage: https://github.com/tesseract-ocr/tesseract/releases
- License: Apache-2.0
-
Latest release: 5.5.0-r2
published 11 months ago
Rankings
Maintainers (1)
alpine-v3.22: tesseract-ocr-data-ron
Tesseract language data for ron
- Homepage: https://github.com/tesseract-ocr/tesseract/releases
- License: Apache-2.0
-
Latest release: 5.5.0-r2
published 11 months ago
Rankings
Maintainers (1)
alpine-v3.22: tesseract-ocr-data-chi_tra
Tesseract language data for chi_tra
- Homepage: https://github.com/tesseract-ocr/tesseract/releases
- License: Apache-2.0
-
Latest release: 5.5.0-r2
published 11 months ago
Rankings
Maintainers (1)
alpine-v3.22: tesseract-ocr-data-jpn
Tesseract language data for jpn
- Homepage: https://github.com/tesseract-ocr/tesseract/releases
- License: Apache-2.0
-
Latest release: 5.5.0-r2
published 11 months ago
Rankings
Maintainers (1)
alpine-v3.22: tesseract-ocr-data-dan
Tesseract language data for dan
- Homepage: https://github.com/tesseract-ocr/tesseract/releases
- License: Apache-2.0
-
Latest release: 5.5.0-r2
published 11 months ago
Rankings
Maintainers (1)
alpine-v3.22: tesseract-ocr-data-khm
Tesseract language data for khm
- Homepage: https://github.com/tesseract-ocr/tesseract/releases
- License: Apache-2.0
-
Latest release: 5.5.0-r2
published 11 months ago
Rankings
Maintainers (1)
alpine-v3.22: tesseract-ocr-data-tgl
Tesseract language data for tgl
- Homepage: https://github.com/tesseract-ocr/tesseract/releases
- License: Apache-2.0
-
Latest release: 5.5.0-r2
published 11 months ago
Rankings
Maintainers (1)
alpine-v3.20: tesseract-ocr-data-ita_old
Tesseract language data for ita_old
- Homepage: https://github.com/tesseract-ocr/tesseract/releases
- License: Apache-2.0
-
Latest release: 5.3.4-r0
published about 2 years ago
Rankings
Maintainers (1)
alpine-v3.19: tesseract-ocr-data-mkd
Tesseract language data for mkd
- Homepage: https://github.com/tesseract-ocr/tesseract/releases
- License: Apache-2.0
-
Latest release: 5.3.3-r1
published over 2 years ago
Rankings
Dependencies
- actions/checkout v3 composite
- actions/checkout v3 composite
- actions/checkout v3 composite
- actions/upload-artifact v3 composite
- google/oss-fuzz/infra/cifuzz/actions/build_fuzzers master composite
- google/oss-fuzz/infra/cifuzz/actions/run_fuzzers master composite
- actions/checkout v3 composite
- actions/upload-artifact v3.1.1 composite
- ilammy/setup-nasm v1 composite
- actions/checkout v3 composite
- actions/checkout v3 composite
- github/codeql-action/analyze v2 composite
- github/codeql-action/init v2 composite
- msys2/setup-msys2 v2 composite
- actions/checkout v3 composite
- msys2/setup-msys2 v2 composite
- actions/checkout v3 composite
- actions/upload-artifact v3 composite
- egorpugin/sw-action master composite
- mikepenz/action-junit-report v3 composite
- actions/checkout v3 composite
- actions/checkout v3 composite
- actions/checkout v3 composite
- actions/checkout v3 composite
- actions/checkout v3 composite
- ubuntu latest build