Building a General Purpose Cross-Domain Sentiment Mining Model

16 years 2 months ago

Download www.cs.indiana.edu

Building a model using machine learning that can classify the sentiment of natural language text often requires an extensive set of labeled training data from the same domain as the target text. Gathering and labeling new datasets whenever a model is needed for a new domain is time-consuming and difﬁcult, especially if a dataset with numeric ratings is not available. In this paper we consider the problem of building models that have a high sentiment classiﬁcation accuracy without the aid of a labeled dataset from the target domain. We show that an adjusted form of cosine similarity between domain lexicons can be used to predict which models will be effective in a new target domain. We also show that ensembles of existing domain models can be used to achieve a classiﬁcation accuracy that approaches that of models trained on data from the target domain.

Matthew Whitehead, Larry Yaeger

Real-time Traffic

Classiﬁcation Accuracy | Computer Science | CSIE 2009 | Sentiment Classiﬁcation Accuracy | Target Domain |

claim paper

Post Info
More Details (n/a)

Added	20 May 2010
Updated	20 May 2010
Type	Conference
Year	2009
Where	CSIE
Authors	Matthew Whitehead, Larry Yaeger

Comments (0)

Sciweavers

Building a General Purpose Cross-Domain Sentiment Mining Model

Classiﬁcation Accuracy | Computer Science | CSIE 2009 | Sentiment Classiﬁcation Accuracy | Target Domain |

Explore & Download

Productivity Tools

Sciweavers