-
-
Notifications
You must be signed in to change notification settings - Fork 701
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
翻译扫描档存在重影 / feat (main): supports ocr on scanned document #19
Comments
图片型的 PDF 文档暂时还没办法翻译,目前主要还是在优化电子书和论文的翻译效果 |
好的,非常感谢 |
均为图像有点为难人了,ocr的质量 影响文字的质量 影响翻译的效果 |
加一个可选流程paddleOCR, |
sayura |
和 minerU/marker 比较怎么样呀 |
sayura 就是 marker 的作者做的开源多国语言和表格的 OCR 模型😂 |
只有一段OCR的内容, 实在是看不懂怎么把OCR出来的结果往后传了。 |
当pdf文件均为图像,而不是可编辑(复制)状态时,翻译完全失败,具体见图
The text was updated successfully, but these errors were encountered: