Skip to main content

Worklog


info

This page is solely for recording our work activities.


2024

November

  • Added i18n: 日本語
  • Updated @docusaurus/core@3.6.1 and found out it's not backward compatible...
    • Spent time updating the problematic code sections.
  • Wrote paper reviews, totaling 135 papers.
  • DocumentAI: Continued development.
  • TextRecognizer: Continued development from October.

October

  • Completed the model demo functionality and deployed it on the website: Playground
  • Moved NextCloud from our own host to GCP and updated all download links.

September

  • MRZScanner: Project completed and made open-source. 🎉 🎉 🎉
  • TextDetector: Continued development, following progress made in March.
  • Came across a beautifully designed website and had to note it down:
  • Wrote paper reviews, totaling 100 papers.

August

  • MRZScanner: Deployment testing and rework.
  • Updated @docusaurus/core@3.5.2 and found out it's not backward compatible...
    • Spent time updating the problematic code sections.
  • Investigated OpenCV dependency issues and discovered we weren’t alone:
  • Wrote paper reviews, with 90 papers reviewed so far.

July

  • Wrote paper reviews, with around 80 papers in total so far.
  • MRZScanner: Began development.

June

  • AutoTraderX: Completed the API integration for Yuanta Securities and made it open-source. 🎉 🎉 🎉
  • Ran out of funds for OpenAI services, so we suspended the daily news push from GmailSummary.
  • Continued writing paper reviews, totaling 50 papers by this point.

May

  • Finished developing the Text Recognizer model.
    • Final evaluation results were promising, but we think it’s still an "overfitted model pretending not to be overfitted." (???)
    • Since it doesn’t meet our ideal standards yet, we’ve decided not to release it for now.
  • Explored Docusaurus' Search feature, tested and integrated Algolia search service.
  • Continued working on the Text Recognizer model, adjusting parameters and training.
  • AutoTraderX: Development started.

April

  • Learned how to configure CSS styles to tweak the blog’s appearance.
  • TextRecognizer: Continued development from WordCanvas and made further progress on the text recognition project.
  • GmailSummary: Modified functionality to push daily news to the tech documentation page.
  • Completed technical documentation for all ongoing projects.
  • Explored Docusaurus’ i18n functionality and started writing English documentation.
  • Investigated Docusaurus’ documentation features and began migrating content from GitHub to the platform.
  • WordCanvas: Project completed and made open-source. 🎉 🎉 🎉

March

One day, we found that the Google Drive download feature broke—what was once accessible through gen_download_cmd became a garbled mess of HTML. 👻 👻 👻

After considering several options...

We decided to use NextCloud to set up a private cloud for storing data and updated our previous download links accordingly.

  • GmailSummary: Completed development and made it open-source. 🎉 🎉 🎉
  • DocClassifier: Discovered that stacking multiple normalization layers significantly improved model performance (a surprising discovery...).
  • TextRecognizer: Early-stage project planning.
  • WordCanvas: Development started.
  • TextDetector: Ran into several issues and decided to put it on hold for now.

February

January

  • TextDetector: Early-stage project planning.
  • DocClassifier: Project completed and made open-source. 🎉 🎉 🎉

2023

December

  • DocClassifier: Development started.
  • DocAligner: Completed development and made it open-source. 🎉 🎉 🎉
  • Website: Discovered Meta’s interesting open-source project Docusaurus. It provides a simple way to build a static website using Markdown for content creation, so I decided to use it to write a blog.
  • Abandoned and deleted the WordPress-built website, migrating all content to the GitHub website project.

November

  • DocClassifier: Early-stage project planning.
  • DocsaidKit: Completed development and made it open-source. 🎉 🎉 🎉
  • Wrote paper reviews, totaling 20 papers.

October

  • WordCanvas: Early-stage project planning.
  • DocGenerator: Completed phase two of development, splitting the text synthesis module into the WordCanvas project.

September

  • DocAligner: Development started.
  • DocGenerator: Phase one of development completed.
  • Wrote paper reviews, totaling 5 papers.

August

  • DocAligner: Early-stage project planning.
  • DocsaidKit: Organized commonly used tools and started development.
  • Explored WordPress functionality, experimented with building a personal blog.
  • Created a DOCSAID GitHub account and started planning various projects.

Before This

We drifted between various jobs, day after day, year after year. Listening to different bosses tell the same dreams, chewing on those tasteless promises.

Countless projects rushed to completion through sleepless nights, intertwining passionate ideals, only to sway between the capital market’s affection and indifference.

When the love fades, it all falls apart.

Before our youth completely slips away, we still want to leave something behind.

Anything will do—just to mark that we were here.