-
-
Notifications
You must be signed in to change notification settings - Fork 1.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
PDF 转换后图表的文字、数字全部消失 #635
Comments
The new backend also has this issue. Will analyze when there's time. |
1.9.1翻译之后一片空白,1.8.8还能用。怀疑字符不显示的问题是字体原因。 |
File "I:\pdf_en2cn\pdf2zhcmd.venv\Lib\site-packages\pymupdf\utils.py", line 5698, in build_subset |
@duofengzhiling #678 等正式版发版吧。这个选项可以修复字体子集化的问题。 |
在提问之前...
使用的环境
描述你的问题
如何复现
执行
pdf2zh ~/Downloads/New_USM_1_30Jul24_1.pdf -s google -li en -lo zh
预期行为
No response
相关 Logs
原始PDF文件
New_USM_1_30Jul24_1.pdf
还有别的吗?
说明
我把numpy降到1.26.4,会出现下方的提示。执行 pdf2zh 的报错信息就是 前面 相关 Logs 贴出来的
如果我更新到numpy-2.2.3 再次执行 pdf2zh 出现下面log报错且 PDF也是空白
转换出错的文档
New_USM_1_30Jul24_1-dual.pdf
New_USM_1_30Jul24_1-mono.pdf
The text was updated successfully, but these errors were encountered: