-
Notifications
You must be signed in to change notification settings - Fork 447
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Duplicate words in OCR result #330
Comments
I think using some overlap detection postprocessing it's possible to filter out those duplicates. |
Thanks for reporting this! I'm not sure which way would be the best, but here are some ideas to handle this:
The first option being natively implemented in most modern DL frameworks, it might be a suitable option to try first |
As a matter of fact, we do want to suppress very small boxes included in other ones, so I suggest the following:
This overlapping seems to be mostly frequent with underscores, so I think it is a good approximation to merge boxes in that case (technically, it is the same word). What do you think @fg-mindee ? |
@charlesmindee Thanks for the suggestion! |
Coming back to this issue, I suggest the following:
But let's not leave this issue unaddressed 😃 |
@frgfm @charlesmindee @odulcy-mindee Seems to be solved with
|
Hi @felixdittrich92, thanks for the suggestion, I think we can change the default behaviour since it is quite natural to preserve the aspect ratio by default. Moreover, it will make the predictions robuster to cropping. |
🐛 Bug
Running the sample code:
I get this result:
Everything looks fine but there is some overlap between different words. The mouse is pointing to the word "Header4" and there is another word with the content "4". In that case I'm not able to reconstruct properly the table header as there is either an extra "4".
To Reproduce
Steps to reproduce the behavior:
The text was updated successfully, but these errors were encountered: