Repository navigation

#

document-data-extraction

TWIX is an open-source data extraction tool that reconstructs structured data from documents at scale, accurately and at low cost, by inferring the shared underlying visual template across documents

Python
201
3 个月前