FTIIBench

(ARXIV24) This is the official code repository for "FTII-Bench: A Comprehensive Multimodal Benchmark for Flow Text with Image Insertion."

Dataset

The text of FTII-Bench could be download from Google Drive

The images of FTII-Bench could be download from Google Drive

Set the appropriate paths in the run_eval_fi and run_eval_sc scripts.

bash run_eval_fi.sh # for flow insertion tasks
bash run_eval_sc.sh # for single choice tasks

For evaluating with BGE models You can run ./mllm_eval/bge_eval.ipynb in the Jupyter environment.

Thanks to the open-source code from Mantis