Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

翻译后乱码 #691

Open
4 tasks done
IgnoranceSmile opened this issue Feb 26, 2025 · 4 comments
Open
4 tasks done

翻译后乱码 #691

IgnoranceSmile opened this issue Feb 26, 2025 · 4 comments
Labels
bug Something isn't working

Comments

@IgnoranceSmile
Copy link

在提问之前...

  • 我已经搜索了现有的 issues
  • 我在提问题之前至少花费了 5 分钟来思考和准备
  • 我已经认真且完整的阅读了 wiki
  • 我已经认真检查了问题和网络环境无关(包括但不限于Google不可用,模型下载失败)

使用的环境

- OS:Ubuntu
- Python:3.10
- pdf2zh:1.9.1

描述你的问题

部分pdf在翻译后排版错乱

如何复现

No response

预期行为

No response

相关 Logs


原始PDF文件

1.pdf

还有别的吗?

No response

@IgnoranceSmile IgnoranceSmile added the bug Something isn't working label Feb 26, 2025
@Liangjy686
Copy link

我也出现了这样的问题,在网页上预览没有问题,但是下载下来就是乱码

@hellofinch
Copy link
Contributor

Image
是指这个地方吗?

@IgnoranceSmile
Copy link
Author

就是整体都有点问题,我这边预览和下载之后都存在问题

Image

@hellofinch
Copy link
Contributor

hellofinch commented Feb 27, 2025

我这里只有一部分有问题,可能是你选择的翻译服务的问题。
如果是使用的LLM作为service的话,可能是对于传入的默认的prompt理解不足导致的。可以尝试调整合适自己和所用service的prompt。
这个是我测试的输出。
1-mono.pdf

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

3 participants