Sciweavers

ICDM
2010
IEEE
109views Data Mining» more  ICDM 2010»
13 years 2 months ago
Term Filtering with Bounded Error
Abstract--In this paper, we consider a novel problem referred to as term filtering with bounded error to reduce the term (feature) space by eliminating terms without (or with bound...
Zi Yang, Wei Li, Jie Tang, Juanzi Li
ECIR
1998
Springer
13 years 6 months ago
User-Chosen Phrases in Interactive Query Formulation for Information Retrieval
The impact of using phrases as content representation for documents and for queries has generally been accepted as a desirable feature in information retrieval systems because phr...
Alan F. Smeaton, Fergus Kelledy
IJCNN
2007
IEEE
13 years 11 months ago
Text Representations for Text Categorization: A Case Study in Biomedical Domain
— In vector space model (VSM), textual documents are represented as vectors in the term space. Therefore, there are two issues in this representation, i.e. (1) what should a term...
Man Lan, Chew Lim Tan, Jian Su, Hwee-Boon Low
WWW
2005
ACM
14 years 5 months ago
A comprehensive comparative study on term weighting schemes for text categorization with support vector machines
Term weighting scheme, which has been used to convert the documents as vectors in the term space, is a vital step in automatic text categorization. In this paper, we conducted com...
Man Lan, Chew Lim Tan, Hwee-Boon Low, Sam Yuan Sun...