Establishing a Document Layout Analysis Baseline for Historical Cipher Keys

dc.contributor.authorHeil, Raphaela
dc.contributor.authorFornés, Alicia
dc.contributor.authorLáng, Benedek
dc.contributor.authorMegyesi, Beáta
dc.contributor.editorDesenclos, Camille
dc.contributor.editorPierrot, Cécile
dc.date.accessioned2026-06-15T11:09:47Z
dc.date.available2026-06-15T11:09:47Z
dc.date.issued2026-06-22
dc.description.abstractHistorical cipher keys encode mappings between plaintext elements and cipher symbols and are characterized by complex, heterogeneous handwritten layouts. This paper establishes a baseline for document layout analysis (DLA) of historical cipher keys using a newly annotated dataset of 350 images from European archives dating from ca. 1300 to 1850 CE. We evaluate four YOLO-based architectures under three conditions: training from scratch, cross-domain transfer from models pre-trained on DocLayNet and CATMuS in a class-agnostic setting, and fine-tuning of these pre-trained models on cipher key data. Results show that training from scratch is limited by data scarcity and unstable convergence, while direct transfer across DLA domains performs poorly. In contrast, fine-tuning consistently improves performance across all architectures, demonstrating the feasibility of adapting existing DLA models to cipher keys and supporting downstream tasks such as key extraction and comparative cryptographic analysis.
dc.identifier.issn1736- 6305
dc.identifier.urihttps://hdl.handle.net/10062/122094
dc.language.isoen
dc.publisherTartu University Library
dc.relation.ispartofseriesNEALT Proceedings Series Number 61
dc.rightsAttribution-NonCommercial-NoDerivatives 4.0 International
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/
dc.subjectcipher keys
dc.subjectdocument layout analysis
dc.subjecthistorical cryptology
dc.titleEstablishing a Document Layout Analysis Baseline for Historical Cipher Keys
dc.typeArticle

Failid

Originaal pakett

Nüüd näidatakse 1 - 1 1
Laen...
Pisipilt
Nimi:
HistoCrypt_2026_paper_35.pdf
Suurus:
23.93 MB
Formaat:
Adobe Portable Document Format