Sciweavers

106 search results - page 3 / 22
» Document Representation and Dimension Reduction for Text Clu...
Sort
View
JCB
2007
106views more  JCB 2007»
13 years 5 months ago
Clustered Sequence Representation for Fast Homology Search
We present a novel approach to managing redundancy in sequence databanks such as GenBank. We store clusters of near-identical sequences as a representative union-sequence and a se...
Michael Cameron, Yaniv Bernstein, Hugh E. Williams
ECIR
2007
Springer
13 years 7 months ago
A Hierarchical Consensus Architecture for Robust Document Clustering
Abstract. A major problem encountered by text clustering practitioners is the difficulty of determining a priori which is the optimal text representation and clustering technique f...
Xavier Sevillano, Germán Cobo, Francesc Al&...
SIGIR
2005
ACM
13 years 11 months ago
Orthogonal locality preserving indexing
We consider the problem of document indexing and representation. Recently, Locality Preserving Indexing (LPI) was proposed for learning a compact document subspace. Different from...
Deng Cai, Xiaofei He
RIAO
2007
13 years 7 months ago
Comprehensible and Accurate Cluster Labels in Text Clustering
The purpose of text clustering in information retrieval is to discover groups of semantically related documents. Accurate and comprehensible cluster descriptions (labels) let the ...
Jerzy Stefanowski, Dawid Weiss
JAIR
2010
94views more  JAIR 2010»
13 years 4 months ago
Which Clustering Do You Want? Inducing Your Ideal Clustering with Minimal Feedback
While traditional research on text clustering has largely focused on grouping documents by topic, it is conceivable that a user may want to cluster documents along other dimension...
Sajib Dasgupta, Vincent Ng