Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

扫描件检测&输出警告 #264

Closed
awwaawwa opened this issue Dec 18, 2024 · 5 comments
Closed

扫描件检测&输出警告 #264

awwaawwa opened this issue Dec 18, 2024 · 5 comments
Labels
enhancement New feature or request

Comments

@awwaawwa
Copy link
Contributor

既然目前处理扫描件存在许多技术难题,那搞个检测&输出个暂时无法处理的提示?

@hellofinch
Copy link
Contributor

其实应该是可以弄的,现在的模型可以提取出不同类型文本的位置,然后ocr也是可以复用这个框的。
我之前没搞定的地方在于怎么生成一个新的pdf页。

@awwaawwa
Copy link
Contributor Author

awwaawwa commented Dec 18, 2024

这样啊,哪天有空了我试试。

@hellofinch
Copy link
Contributor

OCR我之前甩了一段在那个help wanted issue里,可以直接拿去用。
现在的这个pdf的解析我没看明白怎么做的,但是注意一下box的框框和ocr用的框框不太一样。box的框框我记得会小一些,不能用box的框框给ocr,会出错。

@Byaidu Byaidu added the enhancement New feature or request label Dec 18, 2024
@reycn
Copy link
Collaborator

reycn commented Dec 18, 2024

合并讨论至 Issue#19

@reycn reycn closed this as completed Dec 18, 2024
@Issues-translate-bot
Copy link

The issue has been automatically translated into English.


Merged discussion into Issue#19

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

5 participants