Megyesi, BeátaReinares, AlejandraFornés, AliciaGregorio, Giuseppe deDesenclos, CamillePierrot, Cécile2026-06-152026-06-152026-06-221736- 6305https://hdl.handle.net/10062/122083Historical encrypted manuscripts often use invented or heterogeneous alphabets, making alphabet identification a necessary but traditionally manual first step prior to transcription and decryption. This work explores the use of unsupervised computer vision methods to automate this task without requiring labeled data. We propose a pipeline that segments characters from cipher manuscripts, groups them into clusters of visually similar symbols using unsupervised methods, and compares those clusters against a reference database of known alphabet symbols to identify the most likely underlying writing system. Experiments show that the method can correctly identify the alphabet when a handwritten alphabet is available, but performance degrades when handwritten symbols are compared against printed alphabets, with handwriting style dominating shape similarity. These results highlight the importance of realistic handwritten reference alphabets.enAttribution-NonCommercial-NoDerivatives 4.0 Internationalhttps://creativecommons.org/licenses/by/4.0/Ciphered handwritten documentsImage processingAlphabet identificationExploring the Automatic Alphabet Identification of Images of Handwritten CiphersArticle