Repository logo
 
Publication

Use of discourse knowledge to improve lexicon-based sentiment analysis

datacite.subject.fosHumanidades::Outras Humanidadespt_PT
dc.contributor.advisorOräsan, Constantin
dc.contributor.advisorSilva, M ario Jorge Gaspar da
dc.contributor.authorFilho, Pedro Balage
dc.date.accessioned2018-04-12T13:20:17Z
dc.date.available2018-04-12T13:20:17Z
dc.date.issued2012
dc.date.submitted2012
dc.descriptionDissertação de Mestrado, Processamento de Linguagem Natural e Indústria da Língua, Faculdade de Ciências Humanas e Sociais, Universidade do Algarve. School of Law, Social Sciences and Communications, University of Wolverhampton, 2012
dc.description.abstractSentiment Analysis deals with the computational treatment of sentiment in texts. The recent interest for sentiment analysis has grown due the popularity of internet and the increase of user-generated contents, such as blogs, social networks and reviews websites. This work understands sentiment analysis as a classi cation problem. In this problem, a text can be classi ed as positive or negative. Sentiment classi ers can be distinguished by two main approaches: machine learning and lexicon-based. The machine learning approach uses a corpus to automatically learn the best classi cation features. The lexicon-based approach uses a previously computed dictionary with the sentiment lexicon. Discourse is a linguistic level of analysis where the author represents ideas and links concepts in a rational chain of thoughts. One important representation of discourse is the Rhetorical Structure Theory (RST). This theory organizes the discourse in 26 relations that hierarchically represent the text discourse. This objective of this work is to use discourse knowledge to improve a lexicon-based sentiment classi er. To achieve this goal it proposes the SO-RST, a lexicon-based algorithm that weights portions of text under particular RST relations distinctly. Two experiments are reported. The rst experiment veri es if the RST improves sentiment classi cation. It also shows the discourse relations which are most important in the process. The second experiment incorporates discourse markers in the algorithm in order to eliminate the necessity of a RST annotated corpus. It uses the weights learned in the rst experiment to perform the sentiment classi cation. The results obtained showed which RST relations most help the lexicon-based classi er to achieve a better accuracy. The discourse markers introduced in the algorithm showed some directions to follow and the necessary steps to better study this technique.pt_PT
dc.identifier.urihttp://hdl.handle.net/10400.1/10610
dc.language.isoengpt_PT
dc.subjectAnálise de sentimentospt_PT
dc.subjectAnálise de sentimentos em léxicopt_PT
dc.subjectDiscursopt_PT
dc.subjectTeoria da estrutura retóricapt_PT
dc.titleUse of discourse knowledge to improve lexicon-based sentiment analysispt_PT
dc.typemaster thesis
dspace.entity.typePublication
rcaap.rightsrestrictedAccesspt_PT
rcaap.typemasterThesispt_PT
thesis.degree.disciplineProcessamento de Linguagem Natural e Indústria da Língua
thesis.degree.grantorUniversidade do Algarve. Faculdade de Ciências Humanas e Sociais
thesis.degree.levelMestre
thesis.degree.nameMestrado em Processamento de Linguagem Natural e Indústria da Línguapt_PT

Files

Original bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
Master Thesis - Balage Filho.pdf
Size:
722.9 KB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
3.41 KB
Format:
Item-specific license agreed upon to submission
Description: