-
Notifications
You must be signed in to change notification settings - Fork 1.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
marker_single bbox detection crash on non-simple PDFs #127
Comments
Try again after updating the package, I fixed a memory leak after you posted this |
TY @VikParuchuri! This time file1 (ArXiV 2401.12495v1) made it through the first bbox detection loop (5/5 successful). It crashed on the second bbox detection loop (0/4). I'm using marker-pdf v0.2.6. There's some dependency errors that need to be sorted out: langchain-core 0.1.48 --> packaging<24.0,>=23.2; 24.0 installed. |
If you can share the files, it would help me debug. Langchain and mkdocs aren't marker dependencies - installing marker in a virtualenv might help with isolating other dependencies |
about:
history:
$marker_single <file1.pdf>
$marker_single <file2.pdf>
$marker_single <file3.pdf>
file1 is from ArXiV 2401.14295v1 (topologies of reasoning)
file2 is a chapter from a book on game theory. Lots of images.
file3 is a simple HTML-to-pdf glossary doc. No images, just a list of terms & definitions.
The text was updated successfully, but these errors were encountered: