Establishing a Document Layout Analysis Baseline for Historical Cipher Keys
| dc.contributor.author | Heil, Raphaela | |
| dc.contributor.author | Fornés, Alicia | |
| dc.contributor.author | Láng, Benedek | |
| dc.contributor.author | Megyesi, Beáta | |
| dc.contributor.editor | Desenclos, Camille | |
| dc.contributor.editor | Pierrot, Cécile | |
| dc.date.accessioned | 2026-06-15T11:09:47Z | |
| dc.date.available | 2026-06-15T11:09:47Z | |
| dc.date.issued | 2026-06-22 | |
| dc.description.abstract | Historical cipher keys encode mappings between plaintext elements and cipher symbols and are characterized by complex, heterogeneous handwritten layouts. This paper establishes a baseline for document layout analysis (DLA) of historical cipher keys using a newly annotated dataset of 350 images from European archives dating from ca. 1300 to 1850 CE. We evaluate four YOLO-based architectures under three conditions: training from scratch, cross-domain transfer from models pre-trained on DocLayNet and CATMuS in a class-agnostic setting, and fine-tuning of these pre-trained models on cipher key data. Results show that training from scratch is limited by data scarcity and unstable convergence, while direct transfer across DLA domains performs poorly. In contrast, fine-tuning consistently improves performance across all architectures, demonstrating the feasibility of adapting existing DLA models to cipher keys and supporting downstream tasks such as key extraction and comparative cryptographic analysis. | |
| dc.identifier.issn | 1736- 6305 | |
| dc.identifier.uri | https://hdl.handle.net/10062/122094 | |
| dc.language.iso | en | |
| dc.publisher | Tartu University Library | |
| dc.relation.ispartofseries | NEALT Proceedings Series Number 61 | |
| dc.rights | Attribution-NonCommercial-NoDerivatives 4.0 International | |
| dc.rights.uri | https://creativecommons.org/licenses/by/4.0/ | |
| dc.subject | cipher keys | |
| dc.subject | document layout analysis | |
| dc.subject | historical cryptology | |
| dc.title | Establishing a Document Layout Analysis Baseline for Historical Cipher Keys | |
| dc.type | Article |
Failid
Originaal pakett
1 - 1 1
Laen...
- Nimi:
- HistoCrypt_2026_paper_35.pdf
- Suurus:
- 23.93 MB
- Formaat:
- Adobe Portable Document Format