document-layout-analysis

Object Detection Model for Scanned Documents

https://github.com/lynnhado/document-layout-analysis

Science Score: 57.0%

This score indicates how likely this project is to be science-related based on various indicators:

✓
CITATION.cff file
Found CITATION.cff file
✓
codemeta.json file
Found codemeta.json file
✓
.zenodo.json file
Found .zenodo.json file
✓
DOI references
Found 3 DOI reference(s) in README
○
Academic publication links
○
Academic email domains
○
Institutional organization owner
○
JOSS paper metadata
○
Scientific vocabulary similarity
Low similarity (13.7%) to scientific vocabulary

Keywords

document-understanding object-detection python yolov8

Last synced: 11 months ago · JSON representation ·

Repository

Object Detection Model for Scanned Documents

Basic Info

Host: GitHub
Owner: LynnHaDo
License: agpl-3.0
Language: Jupyter Notebook
Default Branch: master
Homepage: https://huggingface.co/spaces/linhdo/document-layout-analysis
Size: 2.86 MB

Statistics

Stars: 90
Watchers: 4
Forks: 14
Open Issues: 1
Releases: 0

Topics

document-understanding object-detection python yolov8

Created about 3 years ago · Last pushed over 1 year ago

Metadata Files

Readme License Citation

Layout Analysis of Scanned Documents

Document Layout Analysis using YOLOv8
View Demo · Report Bug · Request Feature

Table of Contents

Updates
About The Project
- Built With
Getting Started
- Prerequisites
- Installation
Works Cited
Acknowledgments

Updates

In this project, I provided 1 object detection model trained on the existing YOLOv8 weights. They are uploaded in my Hugging Face Space of the project. If you feel the need to use or fine-tune the models in any parts of your work, please cite this repository. Thank you, and don't forget to give this repo a 🌟!

About The Project

Due to the lack of computational resources, I only performed the training process on the Doclaynet-base dataset which contains 6910 train images, 648 val images, 499 test images. However, the model could perform relatively well, further proving the superiority of YOLOv8 model.

(back to top)

Built With

(back to top)

Prerequisites

python 3
ultralytics
numpy
opencv-python

Installation

Clone the repo sh git clone https://github.com/LynnHaDo/Document-Layout-Analysis.git
Install packages sh pip install ultralytics pip install numpy pip install opencv-python
Download Doclaynet dataset and save it as datasets/doclaynet-base
(Optional) Download pretrained YOLOv8s weights

(back to top)

Works Cited

Ultralytics YOLOv8

sh authors: - family-names: Jocher given-names: Glenn orcid: "https://orcid.org/0000-0001-5950-6979" - family-names: Chaurasia given-names: Ayush orcid: "https://orcid.org/0000-0002-7603-6750" - family-names: Qiu given-names: Jing orcid: "https://orcid.org/0000-0003-3783-7069" title: "YOLO by Ultralytics" version: 8.0.0 date-released: 2023-1-10 license: AGPL-3.0 url: "https://github.com/ultralytics/ultralytics"

Doclaynet-base dataset

sh @article{doclaynet2022, title = {DocLayNet: A Large Human-Annotated Dataset for Document-Layout Segmentation}, doi = {10.1145/3534678.353904}, url = {https://doi.org/10.1145/3534678.3539043}, author = {Pfitzmann, Birgit and Auer, Christoph and Dolfi, Michele and Nassar, Ahmed S and Staar, Peter W J}, year = {2022}, isbn = {9781450393850}, publisher = {Association for Computing Machinery}, address = {New York, NY, USA}, booktitle = {Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining}, pages = {3743–3751}, numpages = {9}, location = {Washington DC, USA}, series = {KDD '22} }

Contact

Linh Do - do24l@mtholyoke.edu/dohalinh2303@gmail.com (personal)

Project Link: https://github.com/LynnHaDo/Document-Layout-Analysis

LinkedIn: https://linkedin.com/in/Linh Do

(back to top)

Owner

Name: Lynn Do
Login: LynnHaDo
Kind: user

Repositories: 1
Profile: https://github.com/LynnHaDo

Mount Holyoke College '26

Citation (CITATION.cff)

# This CITATION.cff file was generated with cffinit.
# Visit https://bit.ly/cffinit to generate yours today!

cff-version: 1.2.0
title: Document Layout Analysis Baseline Model
message: >-
  If you use this software, please cite it using the
  metadata from this file.
type: software
authors:
  - given-names: Linh
    family-names: Do
    email: do24l@mtholyoke.edu
repository-code: 'https://github.com/LynnHaDo/Document-Layout-Analysis'
url: >-
  https://huggingface.co/spaces/linhdo/document-layout-analysis
abstract: >-
  Baseline Object Detection Model trained on Doclaynet
  dataset
license: MIT
commit: a44264d

GitHub Events

Total

Issues event: 1
Watch event: 17
Push event: 2
Fork event: 3

Last Year

Issues event: 1
Watch event: 17
Push event: 2
Fork event: 3

Issues and Pull Requests

Last synced: 11 months ago

All Time

Total issues: 1
Total pull requests: 0
Average time to close issues: N/A
Average time to close pull requests: N/A
Total issue authors: 1
Total pull request authors: 0
Average comments per issue: 0.0
Average comments per pull request: 0
Merged pull requests: 0
Bot issues: 0
Bot pull requests: 0

Past Year

Issues: 1
Pull requests: 0
Average time to close issues: N/A
Average time to close pull requests: N/A
Issue authors: 1
Pull request authors: 0
Average comments per issue: 0.0
Average comments per pull request: 0
Merged pull requests: 0
Bot issues: 0
Bot pull requests: 0

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Open Source Science

document-layout-analysis

Science Score: 57.0%

Keywords

Repository

Basic Info

Statistics

Topics

Metadata Files

README.md

Layout Analysis of Scanned Documents

Updates

About The Project

Built With

Prerequisites

Installation

Works Cited

Contact

Owner

Citation (CITATION.cff)

GitHub Events

Total

Last Year

Issues and Pull Requests

All Time

Past Year

Top Authors

Issue Authors

Pull Request Authors

Top Labels

Issue Labels

Pull Request Labels