PP-OCRv5 Collection PP-OCRv5 is the latest text recognition solution, supporting Simplified Chinese, Chinese Pinyin, Traditional Chinese, English, and Japanese โข 13 items โข Updated Sep 15, 2025 โข 52
DocLLM: A layout-aware generative language model for multimodal document understanding Paper โข 2401.00908 โข Published Dec 31, 2023 โข 189
LMDX: Language Model-based Document Information Extraction and Localization Paper โข 2309.10952 โข Published Sep 19, 2023 โข 66