-
Notifications
You must be signed in to change notification settings - Fork 80
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Panic at FirstChar #36
Comments
Can you attach or link to an example pdf? |
Attached are two PDFs, where test.pdf is one generated by Word and version1_3.pdf is the same PDF but converted to version 1.3. When reading the two PDFs using this Rust library, it does not throw an error like the ones I tested before did. There could be some issues with those documents specifically, and I am unable to share them because they have sensitive information. It seems that the issue is not strictly the PDF version (Although version 1.2 had some missed data when extracting) but that may be a fluke or separate issue. |
Can you get a stack from a debug build? |
Here is the error that it runs into with RUST_BACKTRACE=1 set.
|
I created a fork of the project that includes the custom changes that I have been using if you want to look at the exact version of |
I am using the most recent version of this crate, and am using it to extract text from old PDF documents. When dealing with PDF documents with PDF version 1.3, it consistently throws the following error:
Not sure if the issue is actually due to the PDF version, but it seems to be a consistent factor across the PDFs that are causing this panic. For anything version 1.4 or newer it seems to have fewer or different issues.
Here is the backtrace as well:
The text was updated successfully, but these errors were encountered: