Repository navigation

#

alto-xml

Document Layout Analysis resources repos for development with PdfPig.

C#
611
2 年前

Conversions between various OCR formats

75
2 年前

An OCR evaluation tool

Python
65
3 天前

Text Overlay plugin for Mirador 3

JavaScript
54
1 个月前

ALTO XML schema - latest and all former versions

52
9 个月前

Python tools for performing various operations on ALTO XML files

Python
46
2 个月前

Kitodo.Presentation is a feature-rich framework for building a METS- or IIIF-based digital library. It is part of the Kitodo Digital Library Suite.

JavaScript
39
17 天前

Image Retrieval in Digital Libraries - A Multicollection Experimentation of Machine Learning techniques

XQuery
26
13 天前

Data Mining Historical Newspaper Metadata (METS/ALTO formats)

HTML
25
3 年前

Convert ALTO XML to plain text + minimal metadata

Python
16
6 个月前

Command Line Interface (CLI) to export METS/ALTO documents to other formats.

Java
13
3 年前

Extract the MODS/ALTO metadata of a bunch of METS/ALTO files into pandas DataFrames for data analysis

Python
11
5 个月前

A pipeline to transfer ground truth from Transkribus to eScriptorium.

Python
9
1 年前

Helper functions and web app for METS/ALTO archive viewing.

JavaScript
6
2 年前

Extracting illustrations from ALTO documents with IIIF

Perl
5
9 年前

a bunch of scripts to manipulate ALTO and XML/TEI

XSLT
5
4 年前

Create a searchable PDF with ALTO-XML and JP2 files.

CSS
4
4 年前

Data for layout analysis and HTR.

Python
4
4 年前