®®®® SIIA Público

Título del libro:
Título del capítulo: Identification of the minimal set of attributes that maximizes the information towards the author of a political discourse: The case of the candidates in the Mexican presidential elections

Autores UNAM:
JOSE ANTONIO NEME CASTILLO; SERGIO HERNANDEZ LOPEZ; VICENTE CARRION VELAZQUEZ;
Autores externos:

Idioma:

Año de publicación:
2013
Palabras clave:

Character recognition; Genetic algorithms; Learning algorithms; Learning systems; Natural language processing systems; Time series; Authorship attribution; Machine learning communities; Mutual informations; NAtural language processing; Novel methodology; Political discourse; Presidential election; Time series processing; Artificial intelligence


Resumen:

Authorship attribution has attracted the attention of the natural language processing and machine learning communities in the past few years. Here we are interested in finding a general measure of the style followed in the texts from the three main candidates in the Mexican presidential elections of 2012. We analyzed dozens of texts (discourses) from the three authors. We applied tools from the time series processing field and machine learning community in order to identify the overall attributes that define the writing style of the three authors. Several attributes and time series were extracted from each text. A novel methodology, based in mutual information, was applied on those time series and attributes to explore the relevance of each attribute to linearly separate the texts accordingly to their authorship. We show that less than 20 variables are enough to identify, by means of a linear recognizer, the authorship of a text from within one of the three considered authors. © 2013 Springer-Verlag.


Entidades citadas de la UNAM: