GLOBALISE Ground Truth for Handwritten Text and Layout Recognition

This dataset contains Ground Truth PageXML files that were used to finetune the GLOBALISE Handwritten Text Recognition, baseline detection and region detection models (see Related Publications). This collection includes a datasheet with comprehensive details about the motivation for creating this dataset, the files it comprises, and their potential uses. Additionally, it contains guidelines for creating text region Ground Truth. The transcription Ground Truth files were created in accordance with the guidelines of the Dutch National Archives.

Publisher & creator
GLOBALISE project
Temporal Coverage
1610/1796
Languages
English, Dutch
Issued
May 2, 2024
Modified
May 2, 2024

Registration

Registered on
July 24, 2024 at 09:04 AM
Last read
6 days ago
Completeness
80%
Missing properties: dcat:distribution