Sciweavers

ADCS
2004
13 years 5 months ago
Co-Training on Textual Documents with a Single Natural Feature Set
Co-training is a semi-supervised technique that allows classifiers to learn with fewer labelled documents by taking advantage of the more abundant unclassified documents. However, ...
Jason Chan, Irena Koprinska, Josiah Poon
CIKM
2006
Springer
13 years 8 months ago
Knowing a web page by the company it keeps
Web page classification is important to many tasks in information retrieval and web mining. However, applying traditional textual classifiers on web data often produces unsatisfyi...
Xiaoguang Qi, Brian D. Davison
SIGIR
2004
ACM
13 years 9 months ago
Effectiveness of web page classification on finding list answers
List question answering (QA) offers a unique challenge in effectively and efficiently locating a complete set of distinct answers from huge corpora or the Web. In TREC-12, the med...
Hui Yang, Tat-Seng Chua
CIKM
2005
Springer
13 years 10 months ago
Fast webpage classification using URL features
We demonstrate the usefulness of the uniform resource locator (URL) alone in performing web page classification. This approach is magnitudes faster than typical web page classific...
Min-Yen Kan, Hoang Oanh Nguyen Thi
CIKM
2009
Springer
13 years 11 months ago
Improving web page classification by label-propagation over click graphs
In this paper, we present a semi-supervised learning method for web page classification, leveraging click logs to augment training data by propagating class labels to unlabeled si...
Soo-Min Kim, Patrick Pantel, Lei Duan, Scott Gaffn...
KDD
2002
ACM
140views Data Mining» more  KDD 2002»
14 years 4 months ago
PEBL: positive example based learning for Web page classification using SVM
Hwanjo Yu, Jiawei Han, Kevin Chen-Chuan Chang
WWW
2006
ACM
14 years 5 months ago
A comparison of implicit and explicit links for web page classification
It is well known that Web-page classification can be enhanced by using hyperlinks that provide linkages between Web pages. However, in the Web space, hyperlinks are usually sparse...
Dou Shen, Jian-Tao Sun, Qiang Yang, Zheng Chen