Implement first version of parsing evaluation #3544

chloedia · 2025-01-02T15:40:03Z

For a list of potential datasets for parsing see CORE-335.

For details on OmniDocBench see CORE-332 or Notion

Evaluation steps for CI/CD

We only using a single EN subset, where we have excluded masked documents. Each subset contains 57 documents of 1 page each. We will run the evaluation on both native and image pdfs.

Load dataset --> CORE-355
1. For each row in the subset, retrieve the native (original) pdf from the url https://huggingface.co/datasets/Quivr/OmniDocBench/blob/main/ori_pdfs/file_name where file_name is extracted from page_info.image_path
2. For each row in the subset, retrieve the image pdf from the url https://huggingface.co/datasets/Quivr/OmniDocBench/blob/main/pdfs/file_name
Run Megaparse on each document, on both the native-pdf and image-pdf versions, and store the results in a JSON file --> CORE-342
Compute the parsing metrics --> CORE-331
Compute the OCR metrics --> CORE-333
Push the different results (Megaparse output of step 5, output of steps 6 and 7) as JSON files to the exp. tracker, along with --> CORE-343
Alert if metrics are below a given threshold --> CORE-344

Evaluation steps for optimising Megaparse

We should also be able to manually run these evaluations using the full dataset, i.e. subsets 1 to 5, for the purpose of optimizing / improving our parsing service.

The text was updated successfully, but these errors were encountered:

linear · 2025-01-02T15:40:04Z

CORE-326 Implement a first draft of metrics

StanGirard assigned chloedia Jan 20, 2025

jacopo-chevallard assigned jacopo-chevallard and unassigned chloedia Jan 21, 2025

jacopo-chevallard changed the title ~~Implement a first draft of metrics~~ Implement a first draft of parsing metrics Jan 22, 2025

jacopo-chevallard changed the title ~~Implement a first draft of parsing metrics~~ Implement first version of parsing metrics Jan 22, 2025

jacopo-chevallard changed the title ~~Implement first version of parsing metrics~~ Implement first version of parsing evaluation Jan 28, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement first version of parsing evaluation #3544

Implement first version of parsing evaluation #3544

chloedia commented Jan 2, 2025 •

edited by jacopo-chevallard

Loading

linear bot commented Jan 2, 2025

Implement first version of parsing evaluation #3544

Implement first version of parsing evaluation #3544

Comments

chloedia commented Jan 2, 2025 • edited by jacopo-chevallard Loading

Evaluation steps for CI/CD

Evaluation steps for optimising Megaparse

linear bot commented Jan 2, 2025

chloedia commented Jan 2, 2025 •

edited by jacopo-chevallard

Loading