Cross-domain learning methods have shown promising
results by leveraging labeled patterns from auxiliary domains
to learn a robust classifier for target domain, which
has a limi...
Dong Xu, Ivor Wai-Hung Tsang, Lixin Duan, Stephen ...
In this paper we present an algorithm for automatic extraction of textual elements, namely titles and full text, associated with news stories in news web pages. We propose a super...
In the traditional setting, text categorization is formulated as a concept learning problem where each instance is a single isolated document. However, this perspective is not appr...
With the rapid emergence and proliferation of Internet and the trend of globalization, a tremendous amount of textual documents written in different languages are electronically ac...
This research explores the idea of inducing domain-specific semantic class taggers using only a domain-specific text collection and seed words. The learning process begins by indu...