You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi @tylermaran , the image above is part of a PDF page. When I use zerox, I hope zerox can recognize it as a picture and extract the picture and put it in a separate directory, but zerox can't do it at present.
Hey @Wyzanezan. Can you share an example of the type of page you're thinking of?
Right now we're not isolating images out of documents, but it's been something of interest to people. Today we'd just get [image description](image)
Is there a specific modification of the core prompt that would be required to generate a caption. For example, the first hope would be to generate a rich description of the content and a second would be a sufficient prompt to re-generate if necessary.
As an example, a similar project takes this input of a bar chart and produces this brief markdown. At this point, there's no comments on the accuracy of the image but if it's going through some of the big API-based GPTs (or maybe even llava or qwen?) they should do a decent recognition and description.
Oh, and big thanks for sharing your work and providing a great starter that others can enhance for this functionality!
No description provided.
The text was updated successfully, but these errors were encountered: