You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Problem description:
when extract text in Chinese, the result will contain space, then you can't search it in omnisearch(unless you add space manually)
here is a image I test
the result in cache is
{"path":".obsidian/plugins/text-extractor/cache/802469d99ce6f82bfe4e7c007322d468.json","text":"万 东 立 刻 来 了 思 路 写 起 了 大 纲 , 大 致 剧 情 如 下 。 【 女 主 被 匪 徒 绑 架 , 男 主 为 救 女 主 被 匪 徒 击 中 要 害 丧 失 了 生 育 能 力 。 女 主 知 道 后 十 分 的 愧 疚 , 便 把 自 己 的 子 宫 移 植 给 了 男 主","libVersion":"0.2.2","langs":"chi_sim+eng"}
as you can see, it contains space between characters, which cause omnisearch stop working
until I add space munally, then I can get ocr result
Your environment:
Plugin version: 0.4.6
Obsidian version: 1.4.2
Operating system: 14.0
Number of images/PDFs in your vault (approx.): very small
Other plugins that may be related to the issue:
omnisearch
The text was updated successfully, but these errors were encountered:
Problem description:
![image](https://private-user-images.githubusercontent.com/31941670/258851333-d86c1ceb-44dc-456e-a8f8-9f5195f0c715.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3Mzk0NTg4ODQsIm5iZiI6MTczOTQ1ODU4NCwicGF0aCI6Ii8zMTk0MTY3MC8yNTg4NTEzMzMtZDg2YzFjZWItNDRkYy00NTZlLWE4ZjgtOWY1MTk1ZjBjNzE1LnBuZz9YLUFtei1BbGdvcml0aG09QVdTNC1ITUFDLVNIQTI1NiZYLUFtei1DcmVkZW50aWFsPUFLSUFWQ09EWUxTQTUzUFFLNFpBJTJGMjAyNTAyMTMlMkZ1cy1lYXN0LTElMkZzMyUyRmF3czRfcmVxdWVzdCZYLUFtei1EYXRlPTIwMjUwMjEzVDE0NTYyNFomWC1BbXotRXhwaXJlcz0zMDAmWC1BbXotU2lnbmF0dXJlPTA5MjkyZGIwMDMxMGIzZDAxOWVhZDQ4Nzg5MTQ2ZjE5NjI0Nzc5YTNkODNmNTFjZGYxOTI3NWM0YTY4NTg1ZjEmWC1BbXotU2lnbmVkSGVhZGVycz1ob3N0In0.9wJkFpZi9DRRfMLMFmETTYI7dGv8csrFU0T1SQ8QjOA)
![image](https://private-user-images.githubusercontent.com/31941670/258851572-7e4cb493-eaee-41eb-9813-ecb98efc4561.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3Mzk0NTg4ODQsIm5iZiI6MTczOTQ1ODU4NCwicGF0aCI6Ii8zMTk0MTY3MC8yNTg4NTE1NzItN2U0Y2I0OTMtZWFlZS00MWViLTk4MTMtZWNiOThlZmM0NTYxLnBuZz9YLUFtei1BbGdvcml0aG09QVdTNC1ITUFDLVNIQTI1NiZYLUFtei1DcmVkZW50aWFsPUFLSUFWQ09EWUxTQTUzUFFLNFpBJTJGMjAyNTAyMTMlMkZ1cy1lYXN0LTElMkZzMyUyRmF3czRfcmVxdWVzdCZYLUFtei1EYXRlPTIwMjUwMjEzVDE0NTYyNFomWC1BbXotRXhwaXJlcz0zMDAmWC1BbXotU2lnbmF0dXJlPThkNGQ4YWNjZjlkN2NmM2JiYjRmYjJiZTBiNDUxOGMwMTRiNzNiMDMxYzRjZjk0NzNkYmU0MDY0ZGIwNWUzNWImWC1BbXotU2lnbmVkSGVhZGVycz1ob3N0In0.P8Q9SoB0AquUkLXN2K8r7ckkqWIPRwp-5ppMe4V7tWw)
![image](https://private-user-images.githubusercontent.com/31941670/258851740-9f668007-e562-404c-bd63-9f63f7515536.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3Mzk0NTg4ODQsIm5iZiI6MTczOTQ1ODU4NCwicGF0aCI6Ii8zMTk0MTY3MC8yNTg4NTE3NDAtOWY2NjgwMDctZTU2Mi00MDRjLWJkNjMtOWY2M2Y3NTE1NTM2LnBuZz9YLUFtei1BbGdvcml0aG09QVdTNC1ITUFDLVNIQTI1NiZYLUFtei1DcmVkZW50aWFsPUFLSUFWQ09EWUxTQTUzUFFLNFpBJTJGMjAyNTAyMTMlMkZ1cy1lYXN0LTElMkZzMyUyRmF3czRfcmVxdWVzdCZYLUFtei1EYXRlPTIwMjUwMjEzVDE0NTYyNFomWC1BbXotRXhwaXJlcz0zMDAmWC1BbXotU2lnbmF0dXJlPTkzZTBjMmE4MjUzZDI2ZDJmZDdmOWZmMDY4ZDExYjBhNGM2MDNlYzQ0ZDUxNjI4YjFlYTViNzE0NjRkNWZhNGMmWC1BbXotU2lnbmVkSGVhZGVycz1ob3N0In0.aNNaWynu_6z8QRdU27h2xjgYboafkSQQ8gSB0gG_PzE)
when extract text in Chinese, the result will contain space, then you can't search it in omnisearch(unless you add space manually)
here is a image I test
the result in cache is
{"path":".obsidian/plugins/text-extractor/cache/802469d99ce6f82bfe4e7c007322d468.json","text":"万 东 立 刻 来 了 思 路 写 起 了 大 纲 , 大 致 剧 情 如 下 。 【 女 主 被 匪 徒 绑 架 , 男 主 为 救 女 主 被 匪 徒 击 中 要 害 丧 失 了 生 育 能 力 。 女 主 知 道 后 十 分 的 愧 疚 , 便 把 自 己 的 子 宫 移 植 给 了 男 主","libVersion":"0.2.2","langs":"chi_sim+eng"}
as you can see, it contains space between characters, which cause omnisearch stop working
until I add space munally, then I can get ocr result
Your environment:
omnisearch
The text was updated successfully, but these errors were encountered: