Exploring the Automatic Alphabet Identification of Images of Handwritten Ciphers

dc.contributor.authorMegyesi, Beáta
dc.contributor.authorReinares, Alejandra
dc.contributor.authorFornés, Alicia
dc.contributor.authorGregorio, Giuseppe de
dc.contributor.editorDesenclos, Camille
dc.contributor.editorPierrot, Cécile
dc.date.accessioned2026-06-15T10:35:03Z
dc.date.available2026-06-15T10:35:03Z
dc.date.issued2026-06-22
dc.description.abstractHistorical encrypted manuscripts often use invented or heterogeneous alphabets, making alphabet identification a necessary but traditionally manual first step prior to transcription and decryption. This work explores the use of unsupervised computer vision methods to automate this task without requiring labeled data. We propose a pipeline that segments characters from cipher manuscripts, groups them into clusters of visually similar symbols using unsupervised methods, and compares those clusters against a reference database of known alphabet symbols to identify the most likely underlying writing system. Experiments show that the method can correctly identify the alphabet when a handwritten alphabet is available, but performance degrades when handwritten symbols are compared against printed alphabets, with handwriting style dominating shape similarity. These results highlight the importance of realistic handwritten reference alphabets.
dc.identifier.issn1736- 6305
dc.identifier.urihttps://hdl.handle.net/10062/122083
dc.language.isoen
dc.publisherTartu University Library
dc.relation.ispartofseriesNEALT Proceedings Series Number 61
dc.rightsAttribution-NonCommercial-NoDerivatives 4.0 International
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/
dc.subjectCiphered handwritten documents
dc.subjectImage processing
dc.subjectAlphabet identification
dc.titleExploring the Automatic Alphabet Identification of Images of Handwritten Ciphers
dc.typeArticle

Failid

Originaal pakett

Nüüd näidatakse 1 - 1 1
Laen...
Pisipilt
Nimi:
HistoCrypt_2026_paper_28.pdf
Suurus:
1001.2 KB
Formaat:
Adobe Portable Document Format