ignaciaginting's picture
Upload 396 files
230c9a6 verified
==================================
Quick Start
==================================
Once the PDF-Extract-Kit environment is set up and the models are downloaded, we can start using PDF-Extract-Kit.
Layout Detection Example
==============
Layout detection offers several models: ``LayoutLMv3``, ``YOLOv10``, and ``DocLayout-YOLO``. Compared to ``LayoutLMv3``, ``YOLOv10`` is faster. ``DocLayout-YOLO`` is based on YOLOv10 and includes diverse document pre-training and model optimization, offering both speed and high accuracy.
**1. Using Layout Detection Models**
.. code-block:: console
$ python scripts/layout_detection.py --config configs/layout_detection.yaml
After execution, we can view the detection results in the `outputs/layout_detection` directory.
.. note::
The ``layout_detection.yaml`` file sets the input, output, and model configuration. For a more detailed tutorial on layout detection, see :ref:`Layout Detection Algorithm <algorithm_layout_detection>`.
Formula Detection Example
==============
.. code-block:: console
$ python scripts/formula_detection.py --config configs/formula_detection.yaml
After execution, we can view the detection results in the `outputs/formula_detection` directory.
.. note::
The ``formula_detection.yaml`` file sets the input, output, and model configuration. For a more detailed tutorial on formula detection, see :ref:`Formula Detection Algorithm <algorithm_formula_detection>`.