Repository navigation

#

alto-xml

Document Layout Analysis resources repos for development with PdfPig.

C#
623
2 年前

Conversions between various OCR formats

79
2 年前

An OCR evaluation tool

Python
66
3 个月前

Text Overlay plugin for Mirador 3

JavaScript
57
8 天前

ALTO XML schema - latest and all former versions

54
1 年前

Python tools for performing various operations on ALTO XML files

Python
48
6 个月前

Kitodo.Presentation is a feature-rich framework for building a METS- or IIIF-based digital library. It is part of the Kitodo Digital Library Suite.

JavaScript
42
9 小时前

Image Retrieval in Digital Libraries - A Multicollection Experimentation of Machine Learning techniques

XQuery
26
4 个月前

Data Mining Historical Newspaper Metadata (METS/ALTO formats)

HTML
25
3 年前

Convert ALTO XML to plain text + minimal metadata

Python
17
10 个月前

Command Line Interface (CLI) to export METS/ALTO documents to other formats.

Java
13
3 年前

Extract the MODS/ALTO metadata of a bunch of METS/ALTO files into pandas DataFrames for data analysis

Python
12
7 小时前

A pipeline to transfer ground truth from Transkribus to eScriptorium.

Python
9
1 年前

Helper functions and web app for METS/ALTO archive viewing.

JavaScript
6
2 年前

Extracting illustrations from ALTO documents with IIIF

Perl
5
10 年前

a bunch of scripts to manipulate ALTO and XML/TEI

XSLT
5
3 个月前

Create a searchable PDF with ALTO-XML and JP2 files.

CSS
4
5 年前

Data for layout analysis and HTR.

Python
4
4 年前