Search Sciweavers | Sciweavers

3090 search results - page 590 / 618

» Document Processing with LinkIT

146

click to vote

KDD
2008
ACM

183views Data Mining» more KDD 2008»

De-duping URLs via rewrite rules

16 years 5 months ago

Download research.yahoo.com

A large fraction of the URLs on the web contain duplicate (or near-duplicate) content. De-duping URLs is an extremely important problem for search engines, since all the principal...

Anirban Dasgupta, Ravi Kumar, Amit Sasturkar

claim paper

Read More »

149

click to vote

KDD
2008
ACM

119views Data Mining» more KDD 2008»

SAIL: summation-based incremental learning for information-theoretic clustering

16 years 5 months ago

Download datamining.rutgers.edu

Information-theoretic clustering aims to exploit information theoretic measures as the clustering criteria. A common practice on this topic is so-called INFO-K-means, which perfor...

Junjie Wu, Hui Xiong, Jian Chen

claim paper

Read More »

132

click to vote

KDD
2006
ACM

109views Data Mining» more KDD 2006»

Extracting redundancy-aware top-k patterns

16 years 5 months ago

Download www.se.cuhk.edu.hk

Observed in many applications, there is a potential need of extracting a small set of frequent patterns having not only high significance but also low redundancy. The significance...

Dong Xin, Hong Cheng, Xifeng Yan, Jiawei Han

claim paper

Read More »

157

click to vote

KDD
2004
ACM

210views Data Mining» more KDD 2004»

Probabilistic author-topic models for information discovery

16 years 5 months ago

Download psiexp.ss.uci.edu

We propose a new unsupervised learning technique for extracting information from large text collections. We model documents as if they were generated by a two-stage stochastic pro...

Mark Steyvers, Padhraic Smyth, Michal Rosen-Zvi, T...

claim paper

Read More »

166

click to vote

KDD
2002
ACM

112views Data Mining» more KDD 2002»

From run-time behavior to usage scenarios: an interaction-pattern mining approach

16 years 5 months ago

Download www.lans.ece.utexas.edu

A key challenge facing IT organizations today is their evolution towards adopting e-business practices that gives rise to the need for reengineering their underlying software syst...

Mohammad El-Ramly, Eleni Stroulia, Paul G. Sorenso...

claim paper

Read More »

« Prev « First page 590 / 618 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers