Open-access Use of Gramatical Category for Sentiment Identification

ABSTRACT

This research analyzes the impact of the filtering of word n-grams, using their grammatical category, on the identification of sentiment based on text comments from social networks in Spanish. The impact of filtering n-grams containing adjectives, adverbs and interjections was investigated. It was determined that it is possible to reduce the volume of processed data and at the same time achieve an improvement of up to 30 % in the accuracy when classifying an annotated corpus of test comments separating those that contain sentiment from those that do not.

Key Words: Natural Language Processing; Part of Speech; Sentiment Analysis

location_on
Universidad de Costa Rica Universidad de Costa Rica, San José, San José, CR, 2060, 2511-5107, 2511 8395 - E-mail: kanina@ucr.ac.cr
rss_feed Acompanhe os números deste periódico no seu leitor de RSS
Acessibilidade / Reportar erro