-
Notifications
You must be signed in to change notification settings - Fork 5
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
readingDirection and textLineOrder #2
Comments
It is also (trivially) implemented: |
Oh, right! So you can already set the check in the Readme, no? Just curious: why |
Copy-Pasta, the reading order conversion should indeed be recursive.
My idea was that |
All regions can embed all other region types. As for typical cases, I don't know really. I guess that besides the pattern table→text, which is mandatory, the obvious text→text should be pretty pervasive due to cases like block→(heading|paragraph) or block→(drop-capital|paragraph) or block→(list-label|paragraph). Then there's of course image/graphics→text due to the caption relation. But one could think of many combinations, depending on the complexity of the layout and necessecity of representation... I think we should try to be as general and agnostic as is possible. |
On the ALTO-side, one may express regions-in-regions as |
BTW, on the line level, besides |
Perhaps we should simply make this configurable as in #27 for regions: |
Not sure if there's any equivalent for that in ALTO. Glyphs are supposed to be ordered by XML order. Spec does not say anything about words and lines though.
IDNEXT
is only for region-levelReadingOrder
I guess (which you already have in the TODO).But see PRImA-Research-Lab/PAGE-XML#26 for correct interpretation on the PAGE side.
The text was updated successfully, but these errors were encountered: