Case insensitive #675

jesusbft · 2025-01-05T03:33:01Z

Sorry, I have to fix my branch and it delete my older pull request.

Summary: I propose adding an optional feature to calculate a case-insensitive Character Error Rate (CER) for the Kraken OCR test function. This metric would ignore errors caused by uppercase and lowercase differences in the predictions.

Use Case: This feature could be useful for:

OCR projects involving historical texts, where capitalization is often inconsistent.
Scenarios where case differences are less important than the overall recognition accuracy.

How to use:
ketos test (parameters)

The report will show the CER case-insensitive

jesusbft added 3 commits December 21, 2024 12:51

Added random sampling option for testing dataset in Kraken OCR

fb5543e

Include CER Case Insensitive metric in the test report

6596af4

sample percentage

ba8b6b0

mittagessen merged commit 8df97ff into mittagessen:main Feb 9, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Case insensitive #675

Case insensitive #675

jesusbft commented Jan 5, 2025

Case insensitive #675

Case insensitive #675

Conversation

jesusbft commented Jan 5, 2025