Skip to content

docs: add GSSoC OCR engine preprocessing reference guide#1716

Merged
ritesh-1918 merged 5 commits into
ritesh-1918:gssocfrom
zeroknowledge0x:docs/gssoci-ocr-preprocessing
Jun 5, 2026
Merged

docs: add GSSoC OCR engine preprocessing reference guide#1716
ritesh-1918 merged 5 commits into
ritesh-1918:gssocfrom
zeroknowledge0x:docs/gssoci-ocr-preprocessing

Conversation

@zeroknowledge0x

Copy link
Copy Markdown

Summary

Add a comprehensive GSSoC contributor reference guide covering OCR image preprocessing — specifically binarization and rotation correction — to improve text recognition accuracy on ticket images.

Changes

  • Add with:
    • Otsu's automatic binarization (with code)
    • Adaptive Gaussian thresholding (with code)
    • Hough line-based rotation correction (with code)
    • Moment-based deskewing (faster alternative)
    • Full preprocessing pipeline combining all steps
    • Integration notes for the HELPDESK.AI AI API
    • Common pitfalls table with fixes

Testing

  • All Python code snippets are syntactically valid and follow existing HELPDESK.AI conventions
  • Format matches existing handoff reference docs (markdown tables, code blocks, sections)

Related Issues

Fixes #1710

…CUTS_LEGEND field names (combo/description)

Fixes ritesh-1918#1409 - Interactive Keyboard Shortcuts for Admin Dashboard

- Return showHelp/setShowHelp from hook (enables external control of shortcuts modal)
- Fix SHORTCUTS_LEGEND to use {combo, description} fields matching Help.jsx consumer
- Enables AdminLayout to integrate hook-managed shortcuts modal state
- Add MD007, MD009, MD012, MD022, MD026, MD030, MD031, MD032, MD034,
  MD036, MD040, MD055, MD058, MD060 to markdownlint config
- Fix langdetect>=1.2.0 -> langdetect>=1.0.9 (1.2.0 doesn't exist on PyPI)
- Resolves CI failures on PR ritesh-1918#1415
Fixes ritesh-1918#1710

- Add binarization guide (Otsu's + adaptive thresholding)
- Add rotation correction (Hough line + moment-based deskewing)
- Full preprocessing pipeline with code examples
- Integration notes for HELPDESK.AI AI API
- Common pitfalls table
@vercel

vercel Bot commented Jun 4, 2026

Copy link
Copy Markdown

@zeroknowledge0x is attempting to deploy a commit to the ritesh Team on Vercel.

A member of the Team first needs to authorize it.

@coderabbitai

coderabbitai Bot commented Jun 4, 2026

Copy link
Copy Markdown

Important

Review skipped

Auto reviews are disabled on base/target branches other than the default branch.

Please check the settings in the CodeRabbit UI or the .coderabbit.yaml file in this repository. To trigger a single review, invoke the @coderabbitai review command.

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: 2241259b-f960-4a85-89f7-093d6e1834d0

You can disable this status message by setting the reviews.review_status to false in the CodeRabbit configuration file.

Use the checkbox below for a quick retry:

  • 🔍 Trigger review
✨ Finishing Touches
🧪 Generate unit tests (beta)
  • Create PR with unit tests

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

Comment @coderabbitai help to get the list of available commands and usage tips.

@ritesh-1918 ritesh-1918 added gssoc GirlScript Summer of Code gssoc:approved GSSoC Approved PR mentor:ritesh-1918 Reviewed by Project Admin Ritesh level:intermediate Intermediate level difficulty quality:clean Clean code quality type:docs Documentation additions or updates labels Jun 4, 2026
@ritesh-1918

Copy link
Copy Markdown
Owner

Hi @zeroknowledge0x! 🙌

Thank you so much for your excellent contribution: "docs: add GSSoC OCR engine preprocessing reference guide"! We really appreciate the high-quality code and effort you have put into the platform.

Just a quick, friendly heads-up as we prepare our manual merging and verification queues—please make sure to complete all the mandatory community steps listed below.

⚠️ Quick leaderboard tip: To secure the absolute highest GSSoC point tier for your contribution, please make sure you are following our project admin @ritesh-1918 manually on GitHub. Having all four onboarding steps cleared is highly preferable for manual PR approvals! ✨

Once those manual steps are verified, we'll get your PR officially merged into the gssoc branch (or keep it neatly cataloged if closed as integrated) and assign it the highest possible GSSoC S-Tier labels to maximize your leaderboard points!

Let's build something amazing together! 🚀🔥


🌟 Project Support & Developer Network (Show Some Love!)

As we prepare our manual verification and merging queues, please take a quick moment to ensure you have completed all four community steps:

  1. Star this repository: Helps our AI helpdesk get noticed! Star the repo here
  2. 🍴 Fork this repository: Keep a copy to build your own cool tools! Fork here
  3. 👤 Follow @ritesh-1918 on GitHub: Stay updated on real-time open-source projects! Follow ritesh-1918 here
  4. 💼 Connect on LinkedIn: Let's build a strong engineering connection! Connect with Ritesh on LinkedIn

Note: Having all four steps completed manually is required before your PR points are officially cleared.

@ritesh-1918 ritesh-1918 added the level:advanced Advanced level difficulty label Jun 5, 2026
@ritesh-1918

Copy link
Copy Markdown
Owner

Hi @zeroknowledge0x! 🙌

Thank you so much for your excellent contribution: "docs: add GSSoC OCR engine preprocessing reference guide"! We really appreciate the high-quality code and effort you have put into the platform.

Just a quick, friendly heads-up as we prepare our manual merging and verification queues—please make sure to complete all the mandatory community steps listed below.

⚠️ Quick leaderboard tip: To secure the absolute highest GSSoC point tier for your contribution, please make sure you are following our project admin @ritesh-1918 manually on GitHub. Having all four onboarding steps cleared is highly preferable for manual PR approvals! ✨

Once those manual steps are verified, we'll get your PR officially merged into the gssoc branch (or keep it neatly cataloged if closed as integrated) and assign it the highest possible GSSoC S-Tier labels to maximize your leaderboard points!

Let's build something amazing together! 🚀🔥


🌟 Project Support & Developer Network (Show Some Love!)

As we prepare our manual verification and merging queues, please take a quick moment to ensure you have completed all four community steps:

  1. Star this repository: Helps our AI helpdesk get noticed! Star the repo here
  2. 🍴 Fork this repository: Keep a copy to build your own cool tools! Fork here
  3. 👤 Follow @ritesh-1918 on GitHub: Stay updated on real-time open-source projects! Follow ritesh-1918 here
  4. 💼 Connect on LinkedIn: Let's build a strong engineering connection! Connect with Ritesh on LinkedIn

Note: Having all four steps completed manually is required before your PR points are officially cleared.

@ritesh-1918 ritesh-1918 merged commit 9e2a3e2 into ritesh-1918:gssoc Jun 5, 2026
3 of 6 checks passed
@ritesh-1918

Copy link
Copy Markdown
Owner

Merged locally into gssoc branch. Thank you for your GSSoC contribution! 🚀🔥

@ritesh-1918 ritesh-1918 removed the mentor:ritesh-1918 Reviewed by Project Admin Ritesh label Jun 6, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

gssoc:approved GSSoC Approved PR gssoc GirlScript Summer of Code level:advanced Advanced level difficulty level:intermediate Intermediate level difficulty quality:clean Clean code quality type:docs Documentation additions or updates

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants