Home > Research > Publications & Outputs > Sentiment analysis with knowledge resource and ...
View graph of relations

Sentiment analysis with knowledge resource and NLP tools

Research output: Contribution to Journal/MagazineJournal articlepeer-review

Published
Close
<mark>Journal publication date</mark>2009
<mark>Journal</mark>The International Journal of Interdisciplinary Social Sciences
Issue number5
Volume4
Number of pages12
Pages (from-to)17-28
Publication StatusPublished
<mark>Original language</mark>English

Abstract

Automatic sentiment analysis is an important and challenging topic in Human Language Technology (HLT) and text mining, with several applications for social sciences. Over recent years, much effort has been devoted to this subject. Many published works on this subject employ various machine learning techniques. In our work, we investigate the feasibility of automatically identifying text sentiment orientation by combining knowledge resources and NLP techniques, as a complementary method to those based on machine learning. Our approach involves resources and tools including a subjectivity lexicon (Wilson et al., 2005), a set of NLP tools and weighting algorithms. We developed a sentiment analysis tool with a web demonstrator (http://text0.mib.man.ac.uk:8080/opminpackage/opinion_analysis). This tool has been used jointly with other text mining tools to support social scientists in frame analysis from newspaper articles. In this paper, we describe our system and report on our evaluation of the functionality of identifying sentiment orientation at the sentence and document levels. We used the Multi-Perspective Question Answering (MPQA) Corpus (Wiebe et al., 2005) and a collection of 2,000 manually classified film reviews (Pang and Lee, 2004) as the test data. As evaluation measure, we used f-score, a performance measure that combines precision and recall and ranges between 0 (worst performance) and 1 (best performance). In the evaluation, our sentiment analysis tool obtained encouraging results, producing f-scores ranging from 0.6012 to 0.8333 for sentence-level sentiment analysis and an average f-score of 0.7196 for document-level sentiment analysis.