®®®® SIIA Público

Título del libro: 2015 4th International Conference On Informatics, Electronics And Vision, Iciev 2015
Título del capítulo: Comparison of a Modified Spanish phonetic, Soundex, and Phonex coding functions during data matching process

Autores UNAM:
MARIA DEL PILAR ANGELES; ADRIAN ESPINO GAMEZ; JONATHAN EUMIR GIL MONCADA;
Autores externos:

Idioma:
Inglés
Año de publicación:
2015
Palabras clave:

data matching; de-duplication; record linkage


Resumen:

The present paper is aimed to help native spanish speakers to identify an open and effective spanish encoding function during data matching process. We present the implementation and enhancement of the encoding algorithm Spanish Phonetic Soundex [1]. We have carried out an evaluation of data matching considering Spanish Phonetic Soundex, Soundex [2], [3] and Phonex [4] in terms of precision-recall and f-measure. As far as we know, such comparison against these phonetic functions has not been presented before. We suggest spanish speaker users a Modified Spanish Phonetic Soundex function, that has a better performance in terms of precision, f-measure and similarity values derived from the encoding phase than the common phonetic coding functions utilized until now.


Entidades citadas de la UNAM: