Sciweavers

AUSDM
2007
Springer

Using Corpus Analysis to Inform Research into Opinion Detection in Blogs

13 years 9 months ago
Using Corpus Analysis to Inform Research into Opinion Detection in Blogs
Opinion detection research relies on labeled documents for training data, either by assumptions based on the document’s origin or by using human assessors to categorise the documents. In recent years, blogs have become a source for opinion identification research (TREC Blog06). This study analyses the part-of-speech proportion and the words used within various corpora, determining key differences and similarities useful when preparing for opinion identification research. The resulting comparisons between the characteristics of the various corpora is detailed and discussed. In particular, opinion-bearing and nonopinion Blog06 documents were found to display a high level of similarity, indicating that blog documents assessed at the document level cannot be used as training data in opinion identification research.
Deanna J. Osman, John Yearwood, Peter Vamplew
Added 07 Jun 2010
Updated 07 Jun 2010
Type Conference
Year 2007
Where AUSDM
Authors Deanna J. Osman, John Yearwood, Peter Vamplew
Comments (0)