Based on RapidOCR, extract the PDF content
Extract text from images
Analyze layout and detect elements in documents
Extract LaTeX from images and PDFs
Convert PaddleOCR models to ONNX format