DocAligner
The core functionality of this project is called "Document Localization."
ποΈ Introduction
This task is essentially a "precursor" to OCR tasks.
ποΈ Installation
Currently, there is no package available on PyPI, and there are no plans to offer one in the near future.
ποΈ QuickStart
We provide a simple model inference interface, which includes both pre-processing and post-processing logic.
ποΈ Advanced
When invoking the DocAligner model, you can make advanced settings by passing parameters.
ποΈ Model Design
Referencing past research, we first considered a point regression model.
ποΈ Evaluation
We utilized the SmartDoc 2015 dataset for our testing.
ποΈ Discussion
Based on our experiments, we have developed a model that performs quite well.
ποΈ Dataset
In this chapter, we briefly introduce the datasets used for training and testing our models. These datasets include a variety of document images.
ποΈ Submission
In the real world, you are bound to encounter situations where things don't work as expected.
ποΈ References
The literature on this topic is sparse; we've compiled some of the more representative papers to serve as foundational material for research.