Misclassified tables and/or figures maybe tossed incorrectly #1206
Labels
bug
From Hemiptera and especially its suborder Heteroptera
implemented
The issue has been implemented
I have been reported a few cases of text disappearing from the fulltext.
I've identified two issues related to figures and tables.
First case, where paragraphs are misclassified as tables, by the
fulltext
model:Subsequently, the table model classify all the text as
<content>
,and the incomplete table is then tossed away.
I wonder whether it would be possible to detect false positive tables by the related classes and convert them as
<paragraph>
PDF: pub.1158465915.pdf
The text was updated successfully, but these errors were encountered: