Multilingual sentiment analysis in social media

  1. SAN VICENTE RONCAL, IÑAKI
Dirigée par:
  1. Rodrigo Agerri Gascón Directeur
  2. Germán Rigau Claramunt Directeur

Université de défendre: Universidad del País Vasco - Euskal Herriko Unibertsitatea

Fecha de defensa: 11 mars 2019

Jury:
  1. Arantza Díaz de Ilarraza Sánchez President
  2. Núria Bel Rafecas Secrétaire
  3. Horacio Rodríguez Hontoria Rapporteur

Type: Thèses

Teseo: 149228 DIALNET lock_openADDI editor

Résumé

This thesis addresses the task of analysing sentiment in messages coming from social media. The ultimate goal was to develop a Sentiment Analysis system for Basque. However, because of the socio-linguistic reality of the Basque language a tool providing only analysis for Basque would not be enough for a real world application. Thus, we set out to develop a multilingual system, including Basque, English, French and Spanish.The thesis addresses the following challenges to build such a system:- Analysing methods for creating Sentiment lexicons, suitable for less resourced languages.- Analysis of social media (specifically Twitter): Tweets pose several challenges in order to understand and extract opinions from such messages. Language identification and microtext normalization are addressed.- Research the state of the art in polarity classification, and develop a supervised classifier that is tested against well known social media benchmarks.- Develop a social media monitor capable of analysing sentiment with respect to specific events, products or organizations.