DocAligner
The core functionality of this project is called "Document Localization."
📄️ Introduction
This task is essentially a "precursor" to OCR tasks.
📄️ Installation
Currently, there is no package available on PyPI, and there are no plans to offer one in the near future.
📄️ QuickStart
We provide a simple model inference interface, which includes both pre-processing and post-processing logic.
📄️ Advanced
When invoking the DocAligner model, you can make advanced settings by passing parameters.
📄️ Model Design
Referencing past research, we first considered a point regression model.
📄️ Evaluation
We utilized the SmartDoc 2015 dataset for our testing.
📄️ Discussion
Based on our experiments, we have developed a model that performs quite well.
📄️ Dataset
In this chapter, we briefly introduce the datasets used for training and testing our models. These datasets include a variety of document images.
📄️ Submission
In the real world, you are bound to encounter situations where things don't work as expected.
📄️ References
The literature on this topic is sparse; we've compiled some of the more representative papers to serve as foundational material for research.