You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I discover that calling pdf.Page.GetTextByRow and got result with disordered text with some pdf file. For example, I got "761" which should be "176".
I found the result is that in page.go sort with sort.Sort which is not stable, and replace the sort function with sort.Stable can solve the problem.
And pdf.Page.GetTextByColumn also need to modify the same.
The text was updated successfully, but these errors were encountered:
I tried replacing sort.Sort with sort.Stable. It did not help this problem. Text is not in the same order as with r.GetPlainText(). GetPlainText seems to produce text in the correct order, but without linefeeds, it makes the text hard to read.
I discover that calling pdf.Page.GetTextByRow and got result with disordered text with some pdf file. For example, I got "761" which should be "176".
I found the result is that in page.go sort with sort.Sort which is not stable, and replace the sort function with sort.Stable can solve the problem.
And pdf.Page.GetTextByColumn also need to modify the same.
The text was updated successfully, but these errors were encountered: