https://github.com/bertsky/sbb_binarization

Binarize document images

https://github.com/bertsky/sbb_binarization

Science Score: 10.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
  • .zenodo.json file
  • DOI references
  • Academic publication links
    Links to: arxiv.org
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (7.4%) to scientific vocabulary
Last synced: 10 months ago · JSON representation

Repository

Binarize document images

Basic Info
  • Host: GitHub
  • Owner: bertsky
  • License: apache-2.0
  • Default Branch: master
  • Homepage:
  • Size: 146 KB
Statistics
  • Stars: 0
  • Watchers: 1
  • Forks: 0
  • Open Issues: 0
  • Releases: 0
Fork of qurator-spk/sbb_binarization
Created over 5 years ago · Last pushed over 1 year ago

https://github.com/bertsky/sbb_binarization/blob/master/

# Binarization

> Binarization for document images

## Examples



## Introduction

This tool performs document image binarization (i.e. transform colour/grayscale
to black-and-white pixels) for OCR using multiple trained models. 

The method used is based on _Calvo-Zaragoza/Gallego, 2018. [A selectional auto-encoder approach for document image binarization](https://arxiv.org/abs/1706.10241)_.

## Installation

Clone the repository, enter it and run

`pip install .`

### Models

Pre-trained models can be downloaded from here:   

https://qurator-data.de/sbb_binarization/

## Usage

```sh
sbb_binarize \
  --patches \
  -m  \
   \
  
```

**Note** In virtually all cases, the `--patches` flag will improve results.

Owner

  • Name: Robert Sachunsky
  • Login: bertsky
  • Kind: user

GitHub Events

Total
  • Push event: 3
Last Year
  • Push event: 3