From ccf1583659a19d940ca65b4c026e92d241584251 Mon Sep 17 00:00:00 2001 From: felix Date: Thu, 29 Jun 2023 11:52:45 +0200 Subject: [PATCH] update --- docs/source/using_doctr/using_models.rst | 56 +++++++++++++----------- 1 file changed, 31 insertions(+), 25 deletions(-) diff --git a/docs/source/using_doctr/using_models.rst b/docs/source/using_doctr/using_models.rst index 2a8ded7e6..23e8390f0 100644 --- a/docs/source/using_doctr/using_models.rst +++ b/docs/source/using_doctr/using_models.rst @@ -36,31 +36,37 @@ We also provide 2 models working with any kind of rotated documents: For a comprehensive comparison, we have compiled a detailed benchmark on publicly available datasets: -+-----------------------------------------------------------------------------------+----------------------------+----------------------------+---------+ -| | FUNSD | CORD | | -+================+=================================+=================+==============+============+===============+============+===============+=========+ -| **Backend** | **Architecture** | **Input shape** | **# params** | **Recall** | **Precision** | **Recall** | **Precision** | **FPS** | -+----------------+---------------------------------+-----------------+--------------+------------+---------------+------------+---------------+---------+ -| TensorFlow | db_resnet50 | (1024, 1024, 3) | 25.2 M | 82.14 | 87.64 | 92.49 | 89.66 | 2.1 | -+----------------+---------------------------------+-----------------+--------------+------------+---------------+------------+---------------+---------+ -| Tensorflow | db_mobilenet_v3_large | (1024, 1024, 3) | 4.2 M | 79.35 | 84.03 | 81.14 | 66.85 | | -+----------------+---------------------------------+-----------------+--------------+------------+---------------+------------+---------------+---------+ -| TensorFlow | linknet_resnet18 | (1024, 1024, 3) | 25.2 M | 82.14 | 87.64 | 92.49 | 89.66 | 2.1 | -+----------------+---------------------------------+-----------------+--------------+------------+---------------+------------+---------------+---------+ -| Tensorflow | linknet_resnet18_rotation | (1024, 1024, 3) | 4.2 M | 79.35 | 84.03 | 81.14 | 66.85 | | -+----------------+---------------------------------+-----------------+--------------+------------+---------------+------------+---------------+---------+ -| TensorFlow | linknet_resnet34 | (1024, 1024, 3) | 25.2 M | 82.14 | 87.64 | 92.49 | 89.66 | 2.1 | -+----------------+---------------------------------+-----------------+--------------+------------+---------------+------------+---------------+---------+ -| Tensorflow | linknet_resnet50 | (1024, 1024, 3) | 4.2 M | 79.35 | 84.03 | 81.14 | 66.85 | | -+----------------+---------------------------------+-----------------+--------------+------------+---------------+------------+---------------+---------+ -| PyTorch | db_resnet34 | (1024, 1024, 3) | 25.2 M | 82.14 | 87.64 | 92.49 | 89.66 | 2.1 | -+----------------+---------------------------------+-----------------+--------------+------------+---------------+------------+---------------+---------+ -| PyTorch | db_resnet50 | (1024, 1024, 3) | 4.2 M | 79.35 | 84.03 | 81.14 | 66.85 | | -+----------------+---------------------------------+-----------------+--------------+------------+---------------+------------+---------------+---------+ -| PyTorch | db_resnet50_rotation | (1024, 1024, 3) | 25.2 M | 82.14 | 87.64 | 92.49 | 89.66 | 2.1 | -+----------------+---------------------------------+-----------------+--------------+------------+---------------+------------+---------------+---------+ -| PyTorch | db_mobilenet_v3_large | (1024, 1024, 3) | 4.2 M | 79.35 | 84.03 | 81.14 | 66.85 | | -+----------------+---------------------------------+-----------------+--------------+------------+---------------+------------+---------------+---------+ ++-----------------------------------------------------------------------------------+----------------------------+----------------------------+-------------+ +| | FUNSD | CORD | | ++================+=================================+=================+==============+============+===============+============+===============+=============+ +| **Backend** | **Architecture** | **Input shape** | **# params** | **Recall** | **Precision** | **Recall** | **Precision** | **sec/it** | ++----------------+---------------------------------+-----------------+--------------+------------+---------------+------------+---------------+-------------+ +| TensorFlow | db_resnet50 | (1024, 1024, 3) | 25.2 M | 81.22 | 86.66 | 92.46 | 89.62 | 1.2 | ++----------------+---------------------------------+-----------------+--------------+------------+---------------+------------+---------------+-------------+ +| Tensorflow | db_mobilenet_v3_large | (1024, 1024, 3) | 4.2 M | 78.27 | 82.77 | 80.99 | 66.57 | 0.5 | ++----------------+---------------------------------+-----------------+--------------+------------+---------------+------------+---------------+-------------+ +| TensorFlow | linknet_resnet18 | (1024, 1024, 3) | 11.5 M | 78.23 | 83.77 | 82.88 | 82.42 | 0.7 | ++----------------+---------------------------------+-----------------+--------------+------------+---------------+------------+---------------+-------------+ +| Tensorflow | linknet_resnet18_rotation | (1024, 1024, 3) | 11.5 M | 81.12 | 82.13 | 83.55 | 80.14 | 0.6 | ++----------------+---------------------------------+-----------------+--------------+------------+---------------+------------+---------------+-------------+ +| TensorFlow | linknet_resnet34 | (1024, 1024, 3) | 21.6 M | 82.14 | 87.64 | 85.55 | 86.02 | 0.8 | ++----------------+---------------------------------+-----------------+--------------+------------+---------------+------------+---------------+-------------+ +| Tensorflow | linknet_resnet50 | (1024, 1024, 3) | 28.8 M | 79.00 | 84.79 | 85.89 | 65.75 | 1.1 | ++----------------+---------------------------------+-----------------+--------------+------------+---------------+------------+---------------+-------------+ +| PyTorch | db_resnet34 | (1024, 1024, 3) | 22.4 M | | | | | 0.8 | ++----------------+---------------------------------+-----------------+--------------+------------+---------------+------------+---------------+-------------+ +| PyTorch | db_resnet50 | (1024, 1024, 3) | 25.4 M | 79.17 | 86.31 | 92.96 | 91.23 | 1.1 | ++----------------+---------------------------------+-----------------+--------------+------------+---------------+------------+---------------+-------------+ +| PyTorch | db_resnet50_rotation | (1024, 1024, 3) | 25.4 M | 83.30 | 91.07 | 91.63 | 90.53 | 1.6 | ++----------------+---------------------------------+-----------------+--------------+------------+---------------+------------+---------------+-------------+ +| PyTorch | db_mobilenet_v3_large | (1024, 1024, 3) | 4.2 M | 80.06 | 84.12 | 80.51 | 66.51 | 0.5 | ++----------------+---------------------------------+-----------------+--------------+------------+---------------+------------+---------------+-------------+ +| PyTorch | linknet_resnet18 | (1024, 1024, 3) | 11.5 M | | | | | | ++----------------+---------------------------------+-----------------+--------------+------------+---------------+------------+---------------+-------------+ +| PyTorch | linknet_resnet34 | (1024, 1024, 3) | 21.6 M | | | | | | ++----------------+---------------------------------+-----------------+--------------+------------+---------------+------------+---------------+-------------+ +| PyTorch | linknet_resnet50 | (1024, 1024, 3) | 28.8 M | | | | | | ++----------------+---------------------------------+-----------------+--------------+------------+---------------+------------+---------------+-------------+ All text detection models above have been evaluated using both the training and evaluation sets of FUNSD and CORD (cf. :ref:`datasets`).