Sciweavers

2827 search results - page 383 / 566
» Marking Text Documents
Sort
View
ICDM
2007
IEEE
129views Data Mining» more  ICDM 2007»
16 years 13 days ago
Semi-supervised Clustering Using Bayesian Regularization
Text clustering is most commonly treated as a fully automated task without user supervision. However, we can improve clustering performance using supervision in the form of pairwi...
Zuobing Xu, Ram Akella, Mike Ching, Renjie Tang
SAC
2006
ACM
16 years 2 days ago
Exploiting partial decision trees for feature subset selection in e-mail categorization
In this paper we propose PARTfs which adopts a supervised machine learning algorithm, namely partial decision trees, as a method for feature subset selection. In particular, it is...
Helmut Berger, Dieter Merkl, Michael Dittenbach
SIGIR
2005
ACM
15 years 11 months ago
Indexing emails and email threads for retrieval
Electronic mail poses a number of unusual challenges for the design of information retrieval systems and test collections, including informal expression, conversational structure,...
Yejun Wu, Douglas W. Oard
COSIT
2005
Springer
125views GIS» more  COSIT 2005»
15 years 11 months ago
Landmark Extraction: A Web Mining Approach
Landmarks play crucial roles in human geographic knowledge. There has been much work focusing on the extraction of landmarks from geographic information systems (GIS) or 3D city mo...
Taro Tezuka, Katsumi Tanaka
SIGIR
2004
ACM
15 years 11 months ago
Focused named entity recognition using machine learning
In this paper we study the problem of finding most topical named entities among all entities in a document, which we refer to as focused named entity recognition. We show that th...
Li Zhang, Yue Pan, Tong Zhang