Sciweavers

180 search results - page 14 / 36
» Iterated Document Content Classification
Sort
View
ICDM
2009
IEEE
162views Data Mining» more  ICDM 2009»
15 years 18 days ago
Towards a Universal Text Classifier: Transfer Learning Using Encyclopedic Knowledge
Document classification is a key task for many text mining applications. However, traditional text classification requires labeled data to construct reliable and accurate classifie...
Pu Wang, Carlotta Domeniconi
JCDL
2005
ACM
161views Education» more  JCDL 2005»
15 years 8 months ago
Downloading textual hidden web content through keyword queries
An ever-increasing amount of information on the Web today is available only through search interfaces: the users have to type in a set of keywords in a search form in order to acc...
Alexandros Ntoulas, Petros Zerfos, Junghoo Cho
ICML
2005
IEEE
16 years 3 months ago
A model for handling approximate, noisy or incomplete labeling in text classification
We introduce a Bayesian model, BayesANIL, that is capable of estimating uncertainties associated with the labeling process. Given a labeled or partially labeled training corpus of...
Ganesh Ramakrishnan, Krishna Prasad Chitrapura, Ra...
EMNLP
2008
15 years 4 months ago
HTM: A Topic Model for Hypertexts
Previously topic models such as PLSI (Probabilistic Latent Semantic Indexing) and LDA (Latent Dirichlet Allocation) were developed for modeling the contents of plain texts. Recent...
Congkai Sun, Bin Gao, Zhenfu Cao, Hang Li
139
Voted
SIGIR
2008
ACM
15 years 2 months ago
Classifiers without borders: incorporating fielded text from neighboring web pages
Accurate web page classification often depends crucially on information gained from neighboring pages in the local web graph. Prior work has exploited the class labels of nearby p...
Xiaoguang Qi, Brian D. Davison