https://github.com/bertsky/ocrd_jdeskew

OCR-D wrapper for Document Image Skew Estimation using Adaptive Radial Projection

https://github.com/bertsky/ocrd_jdeskew

Science Score: 23.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
  • DOI references
  • Academic publication links
  • Committers with academic emails
    1 of 1 committers (100.0%) from academic institutions
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (12.3%) to scientific vocabulary

Keywords

ocr-d
Last synced: 5 months ago · JSON representation

Repository

OCR-D wrapper for Document Image Skew Estimation using Adaptive Radial Projection

Basic Info
  • Host: GitHub
  • Owner: bertsky
  • License: mit
  • Language: Python
  • Default Branch: master
  • Homepage:
  • Size: 8.79 KB
Statistics
  • Stars: 0
  • Watchers: 1
  • Forks: 0
  • Open Issues: 0
  • Releases: 0
Topics
ocr-d
Created over 2 years ago · Last pushed over 2 years ago
Metadata Files
Readme License

README.md

PyPI version

ocrd_jdeskew

OCR-D wrapper for Document Image Skew Estimation using Adaptive Radial Projection

Introduction

This offers an OCR-D compliant workspace processor for jdeskew.

Installation

Create and activate a virtual environment as usual.

To install this module along with its dependencies, do:

pip install .

Usage

OCR-D processor interface ocrd-jdeskew

To be used with PAGE-XML documents in an OCR-D annotation workflow.

``` Usage: ocrd-jdeskew [OPTIONS]

Deskew pages / regions with jdeskew

Deskew the regions of the workspace.

Open and deserialise PAGE input files and their respective images, then iterate over the element hierarchy down to the requested level-of-operation.

Next, for each segment, crop an image according to its layout annotation (via coordinates into the higher-level image, or from an existing alternative image), and determine optimal the deskewing angle for it (up to maxskew). Annotate the angle in the page or region.

Derotate the image, and add the new image file to the workspace along with the output fileGrp, and using a file ID with suffix .IMG-DESKEW along with further identification of the segment.

Produce a new output file by serialising the resulting hierarchy.

Options for processing: -m, --mets URL-PATH URL or file path of METS to process [./mets.xml] -w, --working-dir PATH Working directory of local workspace [dirname(URL-PATH)] -I, --input-file-grp USE File group(s) used as input -O, --output-file-grp USE File group(s) used as output -g, --page-id ID Physical page ID(s) to process instead of full document [] --overwrite Remove existing output pages/images (with "--page-id", remove only those) --profile Enable profiling --profile-file PROF-PATH Write cProfile stats to PROF-PATH. Implies "--profile" -p, --parameter JSON-PATH Parameters, either verbatim JSON string or JSON file path -P, --param-override KEY VAL Override a single JSON object key-value pair, taking precedence over "--parameter" -l, --log-level [OFF|ERROR|WARN|INFO|DEBUG|TRACE] Override log level globally [INFO]

Options for Processing Worker server: --queue The RabbitMQ server address in format "amqp://{user}:{pass}@{host}:{port}/{vhost}" [amqp://admin:admin@localhost:5672] --database The MongoDB server address in format "mongodb://{host}:{port}" [mongodb://localhost:27018] --type type of processing: either "worker" or "server"

Options for information: -C, --show-resource RESNAME Dump the content of processor resource RESNAME -L, --list-resources List names of processor resources -J, --dump-json Dump tool description as JSON -D, --dump-module-dir Show the 'module' resource location path for this processor -h, --help Show this message -V, --version Show version

Parameters: "maxskew" [number] modulus of maximum skewing angle (in degrees) to detect "level-of-operation" [string - "page"] PAGE XML hierarchy level granularity to annotate orientation and images for Possible values: ["page", "region"]

```

Owner

  • Name: Robert Sachunsky
  • Login: bertsky
  • Kind: user

GitHub Events

Total
Last Year

Committers

Last synced: 11 months ago

All Time
  • Total Commits: 3
  • Total Committers: 1
  • Avg Commits per committer: 3.0
  • Development Distribution Score (DDS): 0.0
Past Year
  • Commits: 0
  • Committers: 0
  • Avg Commits per committer: 0.0
  • Development Distribution Score (DDS): 0.0
Top Committers
Name Email Commits
Robert Sachunsky s****y@i****e 3
Committer Domains (Top 20 + Academic)

Issues and Pull Requests

Last synced: 6 months ago

All Time
  • Total issues: 0
  • Total pull requests: 0
  • Average time to close issues: N/A
  • Average time to close pull requests: N/A
  • Total issue authors: 0
  • Total pull request authors: 0
  • Average comments per issue: 0
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 0
  • Pull requests: 0
  • Average time to close issues: N/A
  • Average time to close pull requests: N/A
  • Issue authors: 0
  • Pull request authors: 0
  • Average comments per issue: 0
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
Pull Request Authors
Top Labels
Issue Labels
Pull Request Labels

Packages

  • Total packages: 1
  • Total downloads:
    • pypi 11 last-month
  • Total dependent packages: 0
  • Total dependent repositories: 0
  • Total versions: 1
  • Total maintainers: 1
pypi.org: ocrd-jdeskew

OCR-D wrapper for Document Image Skew Estimation using Adaptive Radial Projection

  • Documentation: https://ocrd-jdeskew.readthedocs.io/
  • License: MIT License Copyright (c) 2022 Luan Pham Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the "Software"), to deal in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the Software, and to permit persons to whom the Software is furnished to do so, subject to the following conditions: The above copyright notice and this permission notice shall be included in all copies or substantial portions of the Software. THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.
  • Latest release: 0.0.2
    published over 2 years ago
  • Versions: 1
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Downloads: 11 Last month
Rankings
Dependent packages count: 7.2%
Average: 24.3%
Dependent repos count: 41.3%
Maintainers (1)
Last synced: 6 months ago