Integrating Background Knowledge Into Text Classification

10 years 7 months ago
Integrating Background Knowledge Into Text Classification
We present a description of three different algorithms that use background knowledge to improve text classifiers. One uses the background knowledge as an index into the set of training examples. The second method uses background knowledge to reexpress the training examples. The last method treats pieces of background knowledge as unlabeled examples, and actually classifies them. The choice of background knowledge affects each method's performance and we discuss which type of background knowledge is most useful for each specific method. 1 Using Background Knowledge Supervised learning algorithms rely on a corpus of labeled training examples to produce accurate automatic text classifiers. An insufficient number of training examples often results in learned models that are suboptimal when classifying previously unseen examples. Numerous different approaches have been taken to compensate for the lack of training examples. These include the use of unlabeled examples [Bennet and Demiri...
Sarah Zelikovitz, Haym Hirsh
Added 31 Oct 2010
Updated 31 Oct 2010
Type Conference
Year 2003
Authors Sarah Zelikovitz, Haym Hirsh
Comments (0)