Learning to Decipher from Pixels—A Case Study of Copiale
| dc.contributor.author | Kang, Lei | |
| dc.contributor.author | Gregorio, Giuseppe De | |
| dc.contributor.author | Heil, Raphaela | |
| dc.contributor.author | Fornés, Alicia | |
| dc.contributor.author | Megyesi, Beáta | |
| dc.contributor.editor | Desenclos, Camille | |
| dc.contributor.editor | Pierrot, Cécile | |
| dc.date.accessioned | 2026-06-15T10:43:12Z | |
| dc.date.available | 2026-06-15T10:43:12Z | |
| dc.date.issued | 2026-06-22 | |
| dc.description.abstract | Historical encrypted manuscripts require both paleographic interpretation of cipher symbols and cryptanalytic recovery of plaintext. Most existing computational workflows rely on a transcription-first paradigm, in which handwritten symbols are transcribed prior to decipherment. This intermediate step is labor-intensive, error-prone, and not always aligned with the goal of direct plaintext recovery. We propose an end-to-end, transcription-free approach that directly maps handwritten cipher images to plaintext. Using the Copiale cipher as a case study, we introduce the first text-line-level dataset pairing cipher images with German plaintext. We show that pretraining on generic handwriting data followed by cipher-specific fine-tuning substantially improves decipherment accuracy. Our results demonstrate that transcription-free image-to- plaintext decipherment is both feasible and effective for historical substitution ciphers, offering a simplified and scalable alternative to traditional pipelines. | |
| dc.identifier.issn | 1736- 6305 | |
| dc.identifier.uri | https://hdl.handle.net/10062/122085 | |
| dc.language.iso | en | |
| dc.publisher | Tartu University Library | |
| dc.relation.ispartofseries | NEALT Proceedings Series Number 61 | |
| dc.rights | Attribution-NonCommercial-NoDerivatives 4.0 International | |
| dc.rights.uri | https://creativecommons.org/licenses/by/4.0/ | |
| dc.subject | Copiale Cipher | |
| dc.subject | Transcription-Free Decipherment | |
| dc.subject | Handwriting Pre-training | |
| dc.subject | Transformer Models | |
| dc.title | Learning to Decipher from Pixels—A Case Study of Copiale | |
| dc.type | Article |
Failid
Originaal pakett
1 - 1 1
Laen...
- Nimi:
- HistoCrypt_2026_paper_30.pdf
- Suurus:
- 425.31 KB
- Formaat:
- Adobe Portable Document Format