Sciweavers

EMNLP
2007

Building Lexicon for Sentiment Analysis from Massive Collection of HTML Documents

13 years 6 months ago
Building Lexicon for Sentiment Analysis from Massive Collection of HTML Documents
Recognizing polarity requires a list of polar words and phrases. For the purpose of building such lexicon automatically, a lot of studies have investigated (semi-) unsupervised method of learning polarity of words and phrases. In this paper, we explore to use structural clues that can extract polar sentences from Japanese HTML documents, and build lexicon from the extracted polar sentences. The key idea is to develop the structural clues so that it achieves extremely high precision at the cost of recall. In order to compensate for the low recall, we used massive collection of HTML documents. Thus, we could prepare enough polar sentence corpus.
Nobuhiro Kaji, Masaru Kitsuregawa
Added 29 Oct 2010
Updated 29 Oct 2010
Type Conference
Year 2007
Where EMNLP
Authors Nobuhiro Kaji, Masaru Kitsuregawa
Comments (0)