Repository navigation

#

pdf2markdown

Awesome multilingual OCR and Document Parsing toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)

Python
52779
1 天前

A high-quality PDF to Markdown tool based on large language model visual recognition. 一款基于大模型视觉识别的高质量PDF转Markdown工具

Python
1556
23 天前

Validate and annotate image captions easily with AnnotaFlow. This Python tool features a user-friendly GUI, efficient progress tracking, and clean MVC architecture. 🐙📁

Python
0
1 个月前