Sciweavers

1215 search results - page 77 / 243
» Numbers in Multi-relational Data Mining
Sort
View
ADMA
2007
Springer
106views Data Mining» more  ADMA 2007»
15 years 10 months ago
Topic Extraction with AGAPE
This paper uses an optimization approach to address the problem of conceptual clustering. The aim of AGAPE, which is based on the tabu-search meta-heuristic using split, merge and ...
Julien Velcin, Jean-Gabriel Ganascia
106
Voted
ICDM
2006
IEEE
89views Data Mining» more  ICDM 2006»
15 years 10 months ago
Plagiarism Detection in arXiv
We describe a large-scale application of methods for finding plagiarism and self-plagiarism in research document collections. The methods are applied to a collection of 284,834 d...
Daria Sorokina, Johannes Gehrke, Simeon Warner, Pa...
ADMA
2005
Springer
134views Data Mining» more  ADMA 2005»
15 years 6 months ago
An LZ78 Based String Kernel
We develop the notion of normalized information distance (NID) [7] into a kernel distance suitable for use with a Support Vector Machine classifier, and demonstrate its use for an...
Ming Li, Ronan Sleep
SDM
2008
SIAM
95views Data Mining» more  SDM 2008»
15 years 6 months ago
Deterministic Latent Variable Models and Their Pitfalls
We derive a number of well known deterministic latent variable models such as PCA, ICA, EPCA, NMF and PLSA as variational EM approximations with point posteriors. We show that the...
Max Welling, Chaitanya Chemudugunta, Nathan Sutter
141
Voted
SDM
2007
SIAM
96views Data Mining» more  SDM 2007»
15 years 6 months ago
Higher Order Orthogonal Iteration of Tensors (HOOI) and its Relation to PCA and GLRAM
This paper presents a unified view of a number of dimension reduction techniques under the common framework of tensors. Specifically, it is established that PCA, and the recentl...
Bernard N. Sheehan, Yousef Saad