custom-object-character-recognition-ocr-on-aws

https://github.com/pankajdevikar/custom-object-character-recognition-ocr-on-aws

Science Score: 44.0%

This score indicates how likely this project is to be science-related based on various indicators:

✓
CITATION.cff file
Found CITATION.cff file
✓
codemeta.json file
Found codemeta.json file
✓
.zenodo.json file
Found .zenodo.json file
○
DOI references
○
Academic publication links
○
Academic email domains
○
Institutional organization owner
○
JOSS paper metadata
○
Scientific vocabulary similarity
Low similarity (9.4%) to scientific vocabulary

Last synced: 6 months ago · JSON representation ·

Repository

Basic Info

Host: GitHub
Owner: PankajDevikar
License: agpl-3.0
Language: Python
Default Branch: main
Size: 3.77 MB

Statistics

Stars: 0
Watchers: 1
Forks: 0
Open Issues: 0
Releases: 0

Created 12 months ago · Last pushed 12 months ago

Metadata Files

Readme Contributing License Citation

Custom Object Character Recognition (OCR) on AWS Using Appwrite Integration

Building a Custom OCR System by Combining YOLOv3, Tesseract, and Appwrite

1. Overview

This project implements a custom object character recognition (OCR) system designed to extract key information from lab reports. The system integrates a YOLOv3-based text detection model with the Tesseract OCR engine, and it uses Appwrite for cloud storage and API management. The solution automates the process of converting lab reports into editable text, and it is deployed on AWS for scalability.

2. Features

YOLOv3 Text Detection:
Detects regions of interest (text regions) within lab report images.
Tesseract OCR Integration:
Extracts text from the detected regions.
Appwrite Integration:
Utilizes Appwrite for cloud-based data management and API endpoints.
AWS Deployment:
Designed to run on AWS for scalable and cost-efficient processing.
User Interface:
A Streamlit app provides an interactive interface for uploading images, viewing detections, and displaying OCR results.

3. Architecture

3.1 Overall System Diagram

3.2 Detailed Architecture

The system consists of three main components: - Detection Module:
Uses a YOLOv3-based model to detect text regions. - OCR Module:
Uses Tesseract OCR to extract text from detected regions. - Cloud & API Module:
Appwrite manages storage and API endpoints for scalable data handling.

3.3 Appwrite Integration

Appwrite is used to store training data, model outputs, and to provide RESTful API endpoints for accessing results. This enables real-time updates and monitoring through a user-friendly dashboard.

4. Installation

Prerequisites

Python 3.x
Virtual environment (recommended)
AWS Account and proper credentials
Appwrite instance for backend management

📸 Output Screenshots

Below are the screenshots demonstrating the project output:

Owner

Login: PankajDevikar
Kind: user

Repositories: 1
Profile: https://github.com/PankajDevikar

Citation (CITATION.cff)

cff-version: 1.2.0
preferred-citation:
  type: software
  message: If you use YOLOv5, please cite it as below.
  authors:
    - family-names: Jocher
      given-names: Glenn
      orcid: "https://orcid.org/0000-0001-5950-6979"
  title: "YOLOv5 by Ultralytics"
  version: 7.0
  doi: 10.5281/zenodo.3908559
  date-released: 2020-5-29
  license: AGPL-3.0
  url: "https://github.com/ultralytics/yolov5"

GitHub Events

Total

Push event: 9
Create event: 2

Last Year

Push event: 9
Create event: 2

Dependencies

pyproject.toml pypi

matplotlib >=3.3.0
numpy >=1.22.2
opencv-python >=4.6.0
pandas >=1.1.4
pillow >=7.1.2
psutil *
py-cpuinfo *
pyyaml >=5.3.1
requests >=2.23.0
scipy >=1.4.1
seaborn >=0.11.0
thop >=0.1.1
torch >=1.8.0
torchvision >=0.9.0
tqdm >=4.64.0
ultralytics >=8.0.232

requirements.txt pypi

Pillow >=10.3.0
PyYAML >=5.3.1
gitpython >=3.1.30
matplotlib >=3.3
numpy >=1.23.5
opencv-python >=4.1.1
pandas >=1.1.4
psutil *
requests >=2.32.2
scipy >=1.4.1
seaborn >=0.11.0
setuptools >=70.0.0
thop >=0.1.1
torchvision >=0.9.0
tqdm >=4.66.3

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Open Source Science