Skip to main content

DOCSAID

A playground for our developers.

Open Source Projects

Explore our open-source projects and drive AI innovation with us.

Technical Documentation

We share insights to clarify our vision and enhance development.

Research Analysis

We share notes to reveal our thoughts from research papers.

Featured Projects

Capybara

Capybara

Image processing, format conversion, and search tools powered by OpenCV.

Learn More →
WordCanvas

WordCanvas

Text image rendering tool based on Pillow for generating random text images.

Learn More →
DocAligner

DocAligner

Detects documents in images and precisely locates four corner points.

Learn More →
DocClassifier

DocClassifier

Image classification system integrating contrastive learning and PartialFC.

Learn More →
MRZScanner

MRZScanner

Detects and extracts MRZ text regions with 99.97% text similarity.

Learn More →
Chameleon

Chameleon

A deep learning toolbox based on PyTorch, integrating Timm and custom training modules.

Learn More →
AutoTraderX

AutoTraderX

Automated trading, order placement, and quotes via Taiwan securities API.

Learn More →
GmailSummary

GmailSummary

Automates email summarization and organization with Gmail API and OpenAI API.

Learn More →
Capybara

Capybara

Image processing, format conversion, and search tools powered by OpenCV.

Learn More →
WordCanvas

WordCanvas

Text image rendering tool based on Pillow for generating random text images.

Learn More →
DocAligner

DocAligner

Detects documents in images and precisely locates four corner points.

Learn More →
DocClassifier

DocClassifier

Image classification system integrating contrastive learning and PartialFC.

Learn More →
MRZScanner

MRZScanner

Detects and extracts MRZ text regions with 99.97% text similarity.

Learn More →
Chameleon

Chameleon

A deep learning toolbox based on PyTorch, integrating Timm and custom training modules.

Learn More →
AutoTraderX

AutoTraderX

Automated trading, order placement, and quotes via Taiwan securities API.

Learn More →
GmailSummary

GmailSummary

Automates email summarization and organization with Gmail API and OpenAI API.

Learn More →

Features

Below are demos of two models: DocAligner for document alignment and MRZScanner for machine-readable zone recognition.

DocAligner Demo

Upload an image containing a document to detect its corners and perform perspective correction.

Test Images

example1

Text Interference

example2

Partial Occlusion

example3

Strong Reflection

example4

Low Light Scene

example5

Highly Skewed

Demo

OpenCV Download Status

MRZScanner Demo

Upload an image with MRZ (Machine Readable Zone) to detect and parse the text content.

Test Images

example1

Dim Lighting

example2

Office Desk

example3

Outdoors

example4

Interference

example5

Highly Skewed

Demo

MRZ Scanner Parameters
Document Alignment
Center Crop
Post-Processing

Testimonials

Alexis

"Thank you for sharing your programming knowledge with such passion and clarity."

Alexis

cactusgame

"非常感谢分享的文章。"

cactusgame

frankelinli

"非常实用的功能。"

frankelinli