bachelors-thesis

Bachelor's thesis on Automatic License Plate Recognition.

https://github.com/aurosevic/bachelors-thesis

Science Score: 31.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
  • DOI references
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (5.9%) to scientific vocabulary

Keywords

alpr anpr license-plate-detection license-plate-recognition ocr text-detection text-recognition
Last synced: 6 months ago · JSON representation ·

Repository

Bachelor's thesis on Automatic License Plate Recognition.

Basic Info
  • Host: GitHub
  • Owner: aurosevic
  • License: mit
  • Language: TeX
  • Default Branch: main
  • Homepage:
  • Size: 39.9 MB
Statistics
  • Stars: 0
  • Watchers: 1
  • Forks: 0
  • Open Issues: 0
  • Releases: 0
Topics
alpr anpr license-plate-detection license-plate-recognition ocr text-detection text-recognition
Created almost 2 years ago · Last pushed over 1 year ago
Metadata Files
Readme License Citation

README.md

Bachelor's Thesis on Automatic License Plate Recognition using a Single Visual Model

Building the Project and Generating the PDF

zsh biber bachelors-thesis pdflatex bachelors-thesis.tex

Thesis Implementation

For an implementation related to this thesis, please refer to this GitHub repository.

Note: The linked repository may be private at the time of your visit. If you have trouble accessing the repository, feel free to reach out to me for permissions.

Results

Terminal Output Upon Completion of Training

train-code-real-and-synthetic-data

Training Results

Screenshot_20241102_131136

Owner

  • Name: Andrija Urosevic
  • Login: aurosevic
  • Kind: user

Citation (citation.bib)

@misc{shi2015endtoend,
	title={An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition}, 
	author={Baoguang Shi and Xiang Bai and Cong Yao},
	year={2015},
	eprint={1507.05717},
	archivePrefix={arXiv},
	primaryClass={cs.CV}
}

@misc{sheng2019nrtr,
	title={NRTR: A No-Recurrence Sequence-to-Sequence Model For Scene Text Recognition}, 
	author={Fenfen Sheng and Zhineng Chen and Bo Xu},
	year={2019},
	eprint={1806.00926},
	archivePrefix={arXiv},
	primaryClass={cs.CV}
}

@misc{li2019show,
	title={Show, Attend and Read: A Simple and Strong Baseline for Irregular Text Recognition}, 
	author={Hui Li and Peng Wang and Chunhua Shen and Guyu Zhang},
	year={2019},
	eprint={1811.00751},
	archivePrefix={arXiv},
	primaryClass={cs.CV}
}

@misc{zheng2023cdistnet,
	title={CDistNet: Perceiving Multi-Domain Character Distance for Robust Text Recognition}, 
	author={Tianlun Zheng and Zhineng Chen and Shancheng Fang and Hongtao Xie and Yu-Gang Jiang},
	year={2023},
	eprint={2111.11011},
	archivePrefix={arXiv},
	primaryClass={cs.CV}
}

@misc{yu2020accuratescenetextrecognition,
	title={Towards Accurate Scene Text Recognition with Semantic Reasoning Networks}, 
	author={Deli Yu and Xuan Li and Chengquan Zhang and Junyu Han and Jingtuo Liu and Errui Ding},
	year={2020},
	eprint={2003.12294},
	archivePrefix={arXiv},
	primaryClass={cs.CV},
	url={https://arxiv.org/abs/2003.12294}, 
}

@misc{fang2021readlikehumansautonomous,
	title={Read Like Humans: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Recognition}, 
	author={Shancheng Fang and Hongtao Xie and Yuxin Wang and Zhendong Mao and Yongdong Zhang},
	year={2021},
	eprint={2103.06495},
	archivePrefix={arXiv},
	primaryClass={cs.CV},
	url={https://arxiv.org/abs/2103.06495}, 
}	

@article{Hu_Cai_Hou_Yi_Lin_2020,
	title={GTC: Guided Training of CTC towards Efficient and Accurate Scene Text Recognition},
	volume={34},
	url={https://ojs.aaai.org/index.php/AAAI/article/view/6735},
	DOI={10.1609/aaai.v34i07.6735},
	abstractNote={<p>Connectionist Temporal Classification (CTC) and attention mechanism are two main approaches used in recent scene text recognition works. Compared with attention-based methods, CTC decoder has a much shorter inference time, yet a lower accuracy. To design an efficient and effective model, we propose the guided training of CTC (GTC), where CTC model learns a better alignment and feature representations from a more powerful attentional guidance. With the benefit of guided training, CTC model achieves robust and accurate prediction for both regular and irregular scene text while maintaining a fast inference speed. Moreover, to further leverage the potential of CTC decoder, a graph convolutional network (GCN) is proposed to learn the local correlations of extracted features. Extensive experiments on standard benchmarks demonstrate that our end-to-end model achieves a new state-of-the-art for regular and irregular scene text recognition and needs 6 times shorter inference time than attention-based methods.</p>},
	number={07},
	journal={Proceedings of the AAAI Conference on Artificial Intelligence},
	author={Hu, Wenyang and Cai, Xiaocong and Hou, Jun and Yi, Shuai and Lin, Zhiping},
	year={2020},
	month={4},
	pages={11005-11012}
}

@misc{dosovitskiy2021imageworth16x16words,
	title={An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale}, 
	author={Alexey Dosovitskiy and Lucas Beyer and Alexander Kolesnikov and Dirk Weissenborn and Xiaohua Zhai and Thomas Unterthiner and Mostafa Dehghani and Matthias Minderer and Georg Heigold and Sylvain Gelly and Jakob Uszkoreit and Neil Houlsby},
	year={2021},
	eprint={2010.11929},
	archivePrefix={arXiv},
	primaryClass={cs.CV},
	url={https://arxiv.org/abs/2010.11929}, 
}

@misc{liu2021swintransformerhierarchicalvision,
	title={Swin Transformer: Hierarchical Vision Transformer using Shifted Windows}, 
	author={Ze Liu and Yutong Lin and Yue Cao and Han Hu and Yixuan Wei and Zheng Zhang and Stephen Lin and Baining Guo},
	year={2021},
	eprint={2103.14030},
	archivePrefix={arXiv},
	primaryClass={cs.CV},
	url={https://arxiv.org/abs/2103.14030}, 
}

@misc{du2022svtrscenetextrecognition,
	title={SVTR: Scene Text Recognition with a Single Visual Model}, 
	author={Yongkun Du and Zhineng Chen and Caiyan Jia and Xiaoting Yin and Tianlun Zheng and Chenxia Li and Yuning Du and Yu-Gang Jiang},
	year={2022},
	eprint={2205.00159},
	archivePrefix={arXiv},
	primaryClass={cs.CV},
	url={https://arxiv.org/abs/2205.00159}, 
}

@inproceedings{inproceedings,
	author = {Graves, Alex and Fernández, Santiago and Gomez, Faustino and Schmidhuber, Jürgen},
	year = {2006},
	month = {01},
	pages = {369-376},
	title = {Connectionist temporal classification: Labelling unsegmented sequence data with recurrent neural 'networks},
	volume = {2006},
	journal = {ICML 2006 - Proceedings of the 23rd International Conference on Machine Learning},
	doi = {10.1145/1143844.1143891}
}

@misc{hu2020gtcguidedtrainingctc,
	title={GTC: Guided Training of CTC Towards Efficient and Accurate Scene Text Recognition}, 
	author={Wenyang Hu and Xiaocong Cai and Jun Hou and Shuai Yi and Zhiping Lin},
	year={2020},
	eprint={2002.01276},
	archivePrefix={arXiv},
	primaryClass={cs.CV},
	url={https://arxiv.org/abs/2002.01276}, 
}

@misc{fang2021looksequencerethinkingtransformer,
	title={You Only Look at One Sequence: Rethinking Transformer in Vision through Object Detection}, 
	author={Yuxin Fang and Bencheng Liao and Xinggang Wang and Jiemin Fang and Jiyang Qi and Rui Wu and Jianwei Niu and Wenyu Liu},
	year={2021},
	eprint={2106.00666},
	archivePrefix={arXiv},
	primaryClass={cs.CV},
	url={https://arxiv.org/abs/2106.00666}, 
}

@misc{baek2019characterregionawarenesstext,
	title={Character Region Awareness for Text Detection}, 
	author={Youngmin Baek and Bado Lee and Dongyoon Han and Sangdoo Yun and Hwalsuk Lee},
	year={2019},
	eprint={1904.01941},
	archivePrefix={arXiv},
	primaryClass={cs.CV},
	url={https://arxiv.org/abs/1904.01941}, 
}

@book{baidu2022dio,
	title = {Dive Into OCR},
	author = {Chenxia Li and Weiwei Liu and Ruoyu Guo and Xiaoting Yin and Kaitao Jiang and Yongkun Du and Yuning Du and Lingfeng Zhu and Runjie Jin and Keying Liu and Yehua Yang and Ran Bi and Xiaoguang Hu and Dianhai Yu and Yanjun Ma},
	year = {2022},
	publisher = {Baidu}
}

@misc{briefHistoryOfOCR,
	title = {The Brief History of OCR Technology},
	author = {Pankaj Tripathi},
	howpublished = {\url{https://www.docsumo.com/blog/optical-character-recognition-history}},
	year = {2024},
	note = {Accessed: 15.10.2024.}
}

@misc{historyOfOCR,
	title = {The History of OCR},
	author = {Dave Van Everen},
	howpublished = {\url{https://www.veryfi.com/ocr-api-platform/history-of-ocr/}},
	year = {2023},
	note = {Accessed: 15.10.2024.}
}

@article{article2021,
	author = {Wang, Haifeng and Pan, Changzai and Guo, Xiao and Ji, Chunlin and Deng, Ke},
	year = {2021},
	month = {01},
	pages = {},
	title = {From object detection to text detection and recognition: A brief evolution history of optical character recognition},
	volume = {13},
	journal = {Wiley Interdisciplinary Reviews: Computational Statistics},
	doi = {10.1002/wics.1547}
}

@misc{srbRegistracija,
	title = {Gradovi u Srbiji sa pravom na registrovanje vozila},
	author = {Super registracija},
	howpublished = {\url{https://www.super-registracija-vozila.rs/registarske-oznake-u-srbiji/}},
	note = {Accessed: 15.10.2024.}
}

@misc{perplexityUpitZaPrimene,
	title = {Asked a query on perplexity.ai, which uses LLM.},
	author = {Perplexity AI},
	note = {Query used with the following prompt: \enquote{Da li možeš da mi navedeš nekoliko primera primene i upotrebe automatskog prepoznavanja tablica automobila}. Query executed on the site: https://www.perplexity.ai/. Date: 29.07.2024.}
}

@misc{UFPRALPRDataset,
	author = {Rayson Laroca},
	title = {UFPR-ALPR Dataset},
	year = {2024},
	url = {https://github.com/raysonlaroca/ufpr-alpr-dataset},
}

@misc{CCPD,
	author = {zhy},
	title = {CCPD (Chinese City Parking Dataset, ECCV)},
	year = {2020},
	url = {https://github.com/detectRecog/CCPD},
}

@misc{DatasetNinja,
	author = {Dataset Ninja},
	title = {Car License Plate Dataset},
	url = {https://datasetninja.com/car-license-plate},
	note = {Accessed: 20.10.2024.}
}

@misc{LicensePlatesRecognitionDatasetRoboflow,
	author = {Roboflow},
	title = {License Plates Recognition Dataset},
	url = {https://universe.roboflow.com/objectdetection-jhgr1/license-plates-recognition/dataset/2},
	note = {Accessed: 20.10.2024.}
}

GitHub Events

Total
  • Public event: 1
  • Push event: 1
Last Year
  • Public event: 1
  • Push event: 1