Sciweavers

195 search results - page 33 / 39
» The structure of broad topics on the web
Sort
View
74
Voted
WWW
2007
ACM
15 years 10 months ago
A new suffix tree similarity measure for document clustering
In this paper, we propose a new similarity measure to compute the pairwise similarity of text-based documents based on suffix tree document model. By applying the new suffix tree ...
Hung Chim, Xiaotie Deng
86
Voted
TIME
2008
IEEE
15 years 4 months ago
Time Aware Mining of Itemsets
Frequent behavioural pattern mining is a very important topic of knowledge discovery, intended to extract correlations between items recorded in large databases or Web acces logs....
Bashar Saleh, Florent Masseglia
IJPRAI
2002
142views more  IJPRAI 2002»
14 years 9 months ago
Improving Encarta Search Engine Performance by Mining User Logs
We propose a data-mining approach that produces generalized query patterns (with generalized keywords) from the raw user logs of the Microsoft Encarta search engine (http://encart...
Charles X. Ling, Jianfeng Gao, Huajie Zhang, Weini...
77
Voted
KDD
2007
ACM
167views Data Mining» more  KDD 2007»
15 years 10 months ago
Generalized component analysis for text with heterogeneous attributes
We present a class of richly structured, undirected hidden variable models suitable for simultaneously modeling text along with other attributes encoded in different modalities. O...
Xuerui Wang, Chris Pal, Andrew McCallum
KDD
2003
ACM
142views Data Mining» more  KDD 2003»
15 years 10 months ago
Mining phenotypes and informative genes from gene expression data
Mining microarray gene expression data is an important research topic in bioinformatics with broad applications. While most of the previous studies focus on clustering either gene...
Chun Tang, Aidong Zhang, Jian Pei