In automated text categorization, given a small number of labeled documents, it is very challenging, if not impossible, to build a reliable classifier that is able to achieve high...
Zenglin Xu, Rong Jin, Kaizhu Huang, Michael R. Lyu...
Recent techniques show the benefits of attaching communitygenerated knowledge to artifacts in an information space and presenting it to subsequent readers. We argue that such know...
Recognition of object categories from their images is extremely challenging due to the large intra-class variations, and variations in pose, illumination and scale, in addition to...
Two major stages stages in language identification systems can be identified: the language modeling stage, where the distinctive features of languages are determined and stored in...
In many important text classification problems, acquiring class labels for training documents is costly, while gathering large quantities of unlabeled data is cheap. This paper sh...
Kamal Nigam, Andrew McCallum, Sebastian Thrun, Tom...