From Statistics to Neural Networks: Enhancing Ciphertext-Plaintext Alignment in Historical Substitution Ciphers for Automatic Key Extraction

dc.contributor.authorBruton, Micaella
dc.contributor.authorMegyesi, Beáta
dc.contributor.editorAntal, Eugen
dc.contributor.editorMarák, Pavol
dc.date.accessioned2025-05-16T12:50:40Z
dc.date.available2025-05-16T12:50:40Z
dc.date.issued2025
dc.description.abstractCiphertext manuscripts found in archival collections are often intermingled with plaintext manuscripts in various languages, making the manual analysis required to match the documents labour-intensive and complex. Automating the alignment of these texts to reconstruct corresponding cipher keys is therefore highly beneficial, particularly when handling large volumes of documents. This study introduces a novel approach using modern neural networks, specifically Long Short-Term Memory (LSTM) architectures, to develop an automated method for aligning homophonic substitution ciphertexts with plaintext. These neural models are compared to traditional statistical approaches, demonstrating that LSTMs achieve significant accuracy improvements, including perfect alignment for ciphertexts of 50 characters or less. Additionally, to facilitate practical application, a program has been developed to enable the upload of transcribed ciphertext and plaintext documents, using the optimized models to automatically align the texts and extract the substitution key.
dc.identifier.issn1736-6305
dc.identifier.urihttps://hdl.handle.net/10062/109741
dc.language.isoen
dc.publisherTartu University Library
dc.relation.ispartofseriesNEALT Proceedings Series 58
dc.rightsAttribution-NonCommercial-NoDerivatives 4.0 International
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/
dc.subjectCiphertext alignment
dc.subjectPlaintext alignment
dc.subjectHistorical cryptanalysis
dc.subjectNeural cryptanalysis
dc.subjectHomophonic substitution ciphers
dc.subjectLong Short-Term Memory (LSTM)
dc.subjectText alignment
dc.subjectAutomated key extraction
dc.subjectHistorical manuscripts
dc.subjectComputational cryptanalysis
dc.subjectSequence-to-sequence models
dc.titleFrom Statistics to Neural Networks: Enhancing Ciphertext-Plaintext Alignment in Historical Substitution Ciphers for Automatic Key Extraction
dc.typeArticle

Failid

Originaal pakett

Nüüd näidatakse 1 - 1 1
Laen...
Pisipilt
Nimi:
3.pdf
Suurus:
308.38 KB
Formaat:
Adobe Portable Document Format