Solving a 750-Letter General Bigram Substitution Challenge

dc.contributor.authorSchmeh, Klaus
dc.contributor.authorDunin, Elonka
dc.contributor.authorVan Eycke, Jarl
dc.contributor.authorHelm, Louie
dc.contributor.editorAntal, Eugen
dc.contributor.editorMarák, Pavol
dc.date.accessioned2025-05-16T13:29:27Z
dc.date.available2025-05-16T13:29:27Z
dc.date.issued2025
dc.description.abstractThe general bigram substitution cipher is an encryption method originating in the Renaissance. It operates using a substitution table that maps each possible letter pair (bigram) to a unique replacement. While conceptually straightforward, this cipher is notably challenging to break, particularly when dealing with short ciphertexts. To inspire further research, one of the authors initiated a bigram substitution challenge featuring a 750-character ciphertext. In this paper, we present the solution to that challenge, achieved by two other authors using a hill climbing algorithm combined with a scoring function based on 8-gram (eight-letter sequence) frequencies. Since no prior 8-gram frequency statistics existed for the English language, one of the authors developed a comprehensive dataset by analyzing 2 terabytes of text, including 5.8 million books and the entire content of Wikipedia. This achievement, to our knowledge, marks the shortest bigram substitution ciphertext ever successfully decrypted. Furthermore, we propose a new challenge based on a 600-character ciphertext and invite readers to tackle it, setting the stage for future advancements in this field.
dc.identifier.issn1736-6305
dc.identifier.urihttps://hdl.handle.net/10062/109754
dc.language.isoen
dc.publisherTartu University Library
dc.relation.ispartofseriesNEALT Proceedings Series 58
dc.rightsAttribution-NonCommercial-NoDerivatives 4.0 International
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/
dc.subjectbigram substitution
dc.subjectdigraph substitution
dc.subjectPlayfair
dc.subjecthill climbing
dc.subjectsimulated annealing
dc.subjectGiovanni Battista Porta
dc.titleSolving a 750-Letter General Bigram Substitution Challenge
dc.typeArticle

Failid

Originaal pakett

Nüüd näidatakse 1 - 1 1
Laen...
Pisipilt
Nimi:
16.pdf
Suurus:
494.52 KB
Formaat:
Adobe Portable Document Format