Learning to Decipher from Pixels—A Case Study of Copiale

dc.contributor.authorKang, Lei
dc.contributor.authorGregorio, Giuseppe De
dc.contributor.authorHeil, Raphaela
dc.contributor.authorFornés, Alicia
dc.contributor.authorMegyesi, Beáta
dc.contributor.editorDesenclos, Camille
dc.contributor.editorPierrot, Cécile
dc.date.accessioned2026-06-15T10:43:12Z
dc.date.available2026-06-15T10:43:12Z
dc.date.issued2026-06-22
dc.description.abstractHistorical encrypted manuscripts require both paleographic interpretation of cipher symbols and cryptanalytic recovery of plaintext. Most existing computational workflows rely on a transcription-first paradigm, in which handwritten symbols are transcribed prior to decipherment. This intermediate step is labor-intensive, error-prone, and not always aligned with the goal of direct plaintext recovery. We propose an end-to-end, transcription-free approach that directly maps handwritten cipher images to plaintext. Using the Copiale cipher as a case study, we introduce the first text-line-level dataset pairing cipher images with German plaintext. We show that pretraining on generic handwriting data followed by cipher-specific fine-tuning substantially improves decipherment accuracy. Our results demonstrate that transcription-free image-to- plaintext decipherment is both feasible and effective for historical substitution ciphers, offering a simplified and scalable alternative to traditional pipelines.
dc.identifier.issn1736- 6305
dc.identifier.urihttps://hdl.handle.net/10062/122085
dc.language.isoen
dc.publisherTartu University Library
dc.relation.ispartofseriesNEALT Proceedings Series Number 61
dc.rightsAttribution-NonCommercial-NoDerivatives 4.0 International
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/
dc.subjectCopiale Cipher
dc.subjectTranscription-Free Decipherment
dc.subjectHandwriting Pre-training
dc.subjectTransformer Models
dc.titleLearning to Decipher from Pixels—A Case Study of Copiale
dc.typeArticle

Failid

Originaal pakett

Nüüd näidatakse 1 - 1 1
Laen...
Pisipilt
Nimi:
HistoCrypt_2026_paper_30.pdf
Suurus:
425.31 KB
Formaat:
Adobe Portable Document Format