Skip to main content

Worklog


info

This page is solely for recording our work activities.


2024

November

  • Updated @docusaurus/core@3.6.1 and found out it's not backward compatible...
    • Spent time updating the problematic code sections.
  • Wrote paper reviews, totaling 135 papers.
  • DocumentAI: Continued development.
  • TextRecognizer: Continued development from October.

October

  • Completed the model demo functionality and deployed it on the website: Playground
  • Moved NextCloud from our own host to GCP and updated all download links.

September

  • MRZScanner: Project completed and made open-source. 🎉 🎉 🎉
  • TextDetector: Continued development, following progress made in March.
  • Came across a beautifully designed website and had to note it down:
  • Wrote paper reviews, totaling 100 papers.

August

  • MRZScanner: Deployment testing and rework.
  • Updated @docusaurus/core@3.5.2 and found out it's not backward compatible...
    • Spent time updating the problematic code sections.
  • Investigated OpenCV dependency issues and discovered we weren’t alone:
  • Wrote paper reviews, with 90 papers reviewed so far.

July

  • Wrote paper reviews, with around 80 papers in total so far.
  • MRZScanner: Began development.

June

  • AutoTraderX: Completed the API integration for Yuanta Securities and made it open-source. 🎉 🎉 🎉
  • Ran out of funds for OpenAI services, so we suspended the daily news push from GmailSummary.
  • Continued writing paper reviews, totaling 50 papers by this point.

May

  • Finished developing the Text Recognizer model.
    • Final evaluation results were promising, but we think it’s still an "overfitted model pretending not to be overfitted." (???)
    • Since it doesn’t meet our ideal standards yet, we’ve decided not to release it for now.
  • Explored Docusaurus' Search feature, tested and integrated Algolia search service.
  • Continued working on the Text Recognizer model, adjusting parameters and training.
  • AutoTraderX: Development started.

April

  • Learned how to configure CSS styles to tweak the blog’s appearance.
  • TextRecognizer: Continued development from WordCanvas and made further progress on the text recognition project.
  • GmailSummary: Modified functionality to push daily news to the tech documentation page.
  • Completed technical documentation for all ongoing projects.
  • Explored Docusaurus’ i18n functionality and started writing English documentation.
  • Investigated Docusaurus’ documentation features and began migrating content from GitHub to the platform.
  • WordCanvas: Project completed and made open-source. 🎉 🎉 🎉

March

One day, we found that the Google Drive download feature broke—what was once accessible through gen_download_cmd became a garbled mess of HTML. 👻 👻 👻

After considering several options...

We decided to use NextCloud to set up a private cloud for storing data and updated our previous download links accordingly.

  • GmailSummary: Completed development and made it open-source. 🎉 🎉 🎉
  • DocClassifier: Discovered that stacking multiple normalization layers significantly improved model performance (a surprising discovery...).
  • TextRecognizer: Early-stage project planning.
  • WordCanvas: Development started.
  • TextDetector: Ran into several issues and decided to put it on hold for now.

February

January

  • TextDetector: Early-stage project planning.
  • DocClassifier: Project completed and made open-source. 🎉 🎉 🎉

2023

December

  • DocClassifier: Development started.
  • DocAligner: Completed development and made it open-source. 🎉 🎉 🎉
  • Website: Discovered Meta’s interesting open-source project Docusaurus. It provides a simple way to build a static website using Markdown for content creation, so I decided to use it to write a blog.
  • Abandoned and deleted the WordPress-built website, migrating all content to the GitHub website project.

November

  • DocClassifier: Early-stage project planning.
  • DocsaidKit: Completed development and made it open-source. 🎉 🎉 🎉
  • Wrote paper reviews, totaling 20 papers.

October

  • WordCanvas: Early-stage project planning.
  • DocGenerator: Completed phase two of development, splitting the text synthesis module into the WordCanvas project.

September

  • DocAligner: Development started.
  • DocGenerator: Phase one of development completed.
  • Wrote paper reviews, totaling 5 papers.

August

  • DocAligner: Early-stage project planning.
  • DocsaidKit: Organized commonly used tools and started development.
  • Explored WordPress functionality, experimented with building a personal blog.
  • Created a DOCSAID GitHub account and started planning various projects.