Misbehaviour of dimensions for some warped images #17

StaelTchinda · 2023-11-18T18:00:44Z

Hi,

first I want to thank you for your work.

The algorithm works fine for images, where all the document can be seen and is flat.
However, I have a case of a document, where the dimensions of the image are incorrectly computed. (Look part of the verbose below.) Maybe more constraints on the dimensions or corners or coordinates computations are required.

Loaded 67c656c099c941ae759.jpeg at size='1800x1013' --> resized='900x506'
  got 3 spans with 17 points.
  initial objective is 0.00017673946556242466
  optimizing 28 parameters...
  optimization took 0.21 sec.
  final objective is 7.04562825312913e-05
  got page dims 811571190.8768755 x 1.1532271561006338
  output will be 416613624176x592

I am not very familiar with how the code works, but if you know how the problem may be solved, you could quickly explain to me so that I implement it and create a pull request.

Best regards,

The text was updated successfully, but these errors were encountered:

SpicyCatGames · 2024-02-24T06:21:18Z

Having the document would be very helpful. Please provide it if it's something you can share.

joguy56 · 2024-04-09T12:43:00Z

I encountered the same issue on pages that are near to be blank pages for example title pages where there is only one line, one block of text.

I observed that if the page contains multiple lines of text, it is fine.

lmmx · 2024-09-14T23:00:55Z

To reiterate what @SpicyCatGames has said, thank you @StaelTchinda @joguy56 for the bugs! I am working on upgrading my triage process to drive them to completion, it would be super if anyone has a reproducible demo image they can share, this has proven key to solving issues in the past (I know how it works and my intuition can still be off!).

On that note there is a ~~blog post~~ repository wiki explaining how it works and I've made a note to increase the prominence and clarity of docs in forthcoming releases.

Edit it was a wiki I made notes in here, I recall it being quite extensively detailed (probably too detailed for most users but interesting for anyone curious about the inner working), you can find it at https://github.com/lmmx/page-dewarp/wiki

It was the original author Matt Zucker who wrote the blog post, which you can find here: https://mzucker.github.io/2016/08/15/page-dewarping.html

I'm triaging right now but I think a good way to debug this could be to simply attach the repo wiki docs and the source code files in a Claude Project and see what the LLM thinks 😸 I will also give it some good old-fashioned human investigation haha

lmmx · 2024-09-14T23:06:23Z

I encountered the same issue on pages that are near to be blank pages for example title pages where there is only one line, one block of text.

I observed that if the page contains multiple lines of text, it is fine.

This seems like a hint yes, so the algorithm works by finding "line contours" and then using these to find the orientation of the page as input to the "dewarping" algorithm.

To see the intermediate states, I highly recommend running with the debug flag (-d) set to its top value of 3, which will produce them for manual inspection. This was used last week to get to the bottom of an issue with poor results (due to default page margin removing valid lines, which can be fixed by lowering the value of the page margin).

I recently reviewed the code here and as a few years have gone by, I can now more easily see how to simplify it, and I'll be scheduling these upgrades in the coming months.

lmmx self-assigned this Sep 8, 2024

lmmx added this to Planner Sep 8, 2024

lmmx added the bug Something isn't working label Sep 8, 2024

lmmx mentioned this issue Sep 8, 2024

Document how the entire program works, highlight and/or update existing blog post #34

Open

lmmx added this to Page Dewarp Release Planner Sep 8, 2024

lmmx moved this to 🔮 Future in Page Dewarp Release Planner Sep 8, 2024

lmmx moved this to 🐣 Hatching in Planner Sep 10, 2024

lmmx moved this from 🐣 Hatching to 🔙 Backlog in Planner Sep 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Misbehaviour of dimensions for some warped images #17

Misbehaviour of dimensions for some warped images #17

StaelTchinda commented Nov 18, 2023

SpicyCatGames commented Feb 24, 2024

joguy56 commented Apr 9, 2024

lmmx commented Sep 14, 2024 •

edited

Loading

lmmx commented Sep 14, 2024

Misbehaviour of dimensions for some warped images #17

Misbehaviour of dimensions for some warped images #17

Comments

StaelTchinda commented Nov 18, 2023

SpicyCatGames commented Feb 24, 2024

joguy56 commented Apr 9, 2024

lmmx commented Sep 14, 2024 • edited Loading

lmmx commented Sep 14, 2024

lmmx commented Sep 14, 2024 •

edited

Loading