®®®® SIIA Público

Título del libro:
Título del capítulo: A Symbolic Algorithm for the Unification of Nawatl Word Spellings

Autores UNAM:
GERARDO EUGENIO SIERRA MARTINEZ;
Autores externos:

Idioma:

Año de publicación:
2026
Palabras clave:

Corpus; Linguistic rules; Nahuatl; Semantic similarity; Symbolic algorithms; Symbolic modeling; Symbolic NLP algorithm; Text document; Unification algorithms; Unified spelling


Resumen:

© The Author(s), under exclusive license to Springer Nature Switzerland AG 2026.In this paper, we describe a symbolic model for the automatic orthographic unification of Nawatl text documents. Our model is based on algorithms that we have previously used to analyze sentences in Nawatl, and on the corpus called p-yalli, consisting of texts in several Nawatl orthographies. Our automatic unification algorithm implements linguistic rules in symbolic regular expressions. We also present a manual evaluation protocol that we have proposed and implemented to assess the quality of the unified sentences generated by our algorithm, by testing in a sentence semantic task. We have obtained encouraging results from the evaluators for most of the desired features of our artificially unified sentences.


Entidades citadas de la UNAM: