Skip to main content

DOCSAID

A playground for our developers.

Open Source Projects

Explore our open-source projects and drive AI innovation with us.

Technical Documentation

We share insights to clarify our vision and enhance development.

Research Analysis

We share notes to reveal our thoughts from research papers.

Featured Projects

Capybara

Capybara

Image processing, format conversion, and search tools powered by OpenCV.

Learn More →
WordCanvas

WordCanvas

Text image rendering tool based on Pillow for generating random text images.

Learn More →
DocAligner

DocAligner

Detects documents in images and precisely locates four corner points.

Learn More →
DocClassifier

DocClassifier

Image classification system integrating contrastive learning and PartialFC.

Learn More →
MRZScanner

MRZScanner

Detects and extracts MRZ text regions with 99.97% text similarity.

Learn More →
Chameleon

Chameleon

A deep learning toolbox based on PyTorch, integrating Timm and custom training modules.

Learn More →
AutoTraderX

AutoTraderX

Automated trading, order placement, and quotes via Taiwan securities API.

Learn More →
GmailSummary

GmailSummary

Automates email summarization and organization with Gmail API and OpenAI API.

Learn More →
Capybara

Capybara

Image processing, format conversion, and search tools powered by OpenCV.

Learn More →
WordCanvas

WordCanvas

Text image rendering tool based on Pillow for generating random text images.

Learn More →
DocAligner

DocAligner

Detects documents in images and precisely locates four corner points.

Learn More →
DocClassifier

DocClassifier

Image classification system integrating contrastive learning and PartialFC.

Learn More →
MRZScanner

MRZScanner

Detects and extracts MRZ text regions with 99.97% text similarity.

Learn More →
Chameleon

Chameleon

A deep learning toolbox based on PyTorch, integrating Timm and custom training modules.

Learn More →
AutoTraderX

AutoTraderX

Automated trading, order placement, and quotes via Taiwan securities API.

Learn More →
GmailSummary

GmailSummary

Automates email summarization and organization with Gmail API and OpenAI API.

Learn More →

🤝 Model-Centric Consulting & Services

A small model-focused studio. We turn real needs into maintainable, deployable, and evolvable model modules, working embedded with your team. Frontend/backend are lightweight—only to evaluate, showcase, and integrate models.

🧩 Module Dev & Maintenance

Productize one model module and maintain it long-term with versioned data & benchmarks

🗓️ Consulting

1–2 days/week embedded collaboration: experiment design, data governance, evaluation

🚀 MVP from Zero

Build a minimal, demonstrable product around the model: selection, API, lightweight UI

⚡ Inference Optimization & Deployment

ONNX/TensorRT, quantization, latency budgeting; SDK/REST, batch/stream, private deploy

📚 For full service details, deliverables, and budget ranges, please visit: Full Service Overview

⚠️ Important Notes
  • 🧠 Model-first scope: FE/BE are lightweight for evaluation/showcase/integration only
  • 🗓️ Ongoing work often uses a monthly retainer; final quotes follow scoping
  • 💵 Billing in TWD (Taiwan).
  • 🌍 Cross-timezone/international work is welcome—please schedule in advance
  • ❌ We don’t offer LLM self-training (consultation and system evaluation are OK)
  • 📜 NDA supported; change logs and rollback strategy provided

Cooperation Form

Please fill in the following information. I will reply within 1 to 2 business days.

Not sure yet
1
We can maintain multiple projects concurrently; pricing is per project per month.
Not sure yet
Starts at TWD 100,000 per project per month; final quote after scoping (retainer is common for ongoing work).

Features

Below are demos of two model modules: DocAligner for document alignment and MRZScanner for MRZ recognition.

DocAligner Demo

Upload an image containing a document to detect key points and perform perspective correction.

Test Images

example1

Text Interference

example2

Partial Occlusion

example3

Strong Reflection

example4

Low Light Scene

example5

Highly Skewed

Demo

OpenCV Download Status

MRZScanner Demo

Upload an image with MRZ (Machine Readable Zone) to detect and parse the text content.

Test Images

example1

Dim Lighting

example2

Office Desk

example3

Outdoors

example4

Interference

example5

Highly Skewed

Demo

MRZ Scanner Parameters
Document Alignment
Center Crop
Post-Processing

Testimonials

Alexis

"Thank you for sharing your programming knowledge with such passion and clarity."

Alexis

cactusgame

"非常感谢分享的文章。"

cactusgame

frankelinli

"非常实用的功能。"

frankelinli