Solving a 750-Letter General Bigram Substitution Challenge
| dc.contributor.author | Schmeh, Klaus | |
| dc.contributor.author | Dunin, Elonka | |
| dc.contributor.author | Van Eycke, Jarl | |
| dc.contributor.author | Helm, Louie | |
| dc.contributor.editor | Antal, Eugen | |
| dc.contributor.editor | Marák, Pavol | |
| dc.date.accessioned | 2025-05-16T13:29:27Z | |
| dc.date.available | 2025-05-16T13:29:27Z | |
| dc.date.issued | 2025 | |
| dc.description.abstract | The general bigram substitution cipher is an encryption method originating in the Renaissance. It operates using a substitution table that maps each possible letter pair (bigram) to a unique replacement. While conceptually straightforward, this cipher is notably challenging to break, particularly when dealing with short ciphertexts. To inspire further research, one of the authors initiated a bigram substitution challenge featuring a 750-character ciphertext. In this paper, we present the solution to that challenge, achieved by two other authors using a hill climbing algorithm combined with a scoring function based on 8-gram (eight-letter sequence) frequencies. Since no prior 8-gram frequency statistics existed for the English language, one of the authors developed a comprehensive dataset by analyzing 2 terabytes of text, including 5.8 million books and the entire content of Wikipedia. This achievement, to our knowledge, marks the shortest bigram substitution ciphertext ever successfully decrypted. Furthermore, we propose a new challenge based on a 600-character ciphertext and invite readers to tackle it, setting the stage for future advancements in this field. | |
| dc.identifier.issn | 1736-6305 | |
| dc.identifier.uri | https://hdl.handle.net/10062/109754 | |
| dc.language.iso | en | |
| dc.publisher | Tartu University Library | |
| dc.relation.ispartofseries | NEALT Proceedings Series 58 | |
| dc.rights | Attribution-NonCommercial-NoDerivatives 4.0 International | |
| dc.rights.uri | https://creativecommons.org/licenses/by/4.0/ | |
| dc.subject | bigram substitution | |
| dc.subject | digraph substitution | |
| dc.subject | Playfair | |
| dc.subject | hill climbing | |
| dc.subject | simulated annealing | |
| dc.subject | Giovanni Battista Porta | |
| dc.title | Solving a 750-Letter General Bigram Substitution Challenge | |
| dc.type | Article |
Failid
Originaal pakett
1 - 1 1