Search Sciweavers | Sciweavers

709 search results - page 107 / 142

» Constraint-Based Pattern Set Mining

201

click to vote

KDD
2008
ACM

183views Data Mining» more KDD 2008»

De-duping URLs via rewrite rules

16 years 7 months ago

Download research.yahoo.com

A large fraction of the URLs on the web contain duplicate (or near-duplicate) content. De-duping URLs is an extremely important problem for search engines, since all the principal...

Anirban Dasgupta, Ravi Kumar, Amit Sasturkar

claim paper

Read More »

203

click to vote

KDD
2008
ACM

120views Data Mining» more KDD 2008»

Entity categorization over large document collections

16 years 7 months ago

Download www.ics.uci.edu

Extracting entities (such as people, movies) from documents and identifying the categories (such as painter, writer) they belong to enable structured querying and data analysis ov...

Arnd Christian König, Rares Vernica, Venkates...

claim paper

Read More »

213

Voted

KDD
2004
ACM

302views Data Mining» more KDD 2004»

Redundancy based feature selection for microarray data

16 years 7 months ago

Download www.public.asu.edu

In gene expression microarray data analysis, selecting a small number of discriminative genes from thousands of genes is an important problem for accurate classification of diseas...

Lei Yu, Huan Liu

claim paper

Read More »

180

Voted

KDD
2002
ACM

109views Data Mining» more KDD 2002»

Topics in 0--1 data

16 years 7 months ago

Download www.cis.hut.fi

Large 0-1 datasets arise in various applications, such as market basket analysis and information retrieval. We concentrate on the study of topic models, aiming at results which in...

Ella Bingham, Heikki Mannila, Jouni K. Seppän...

claim paper

Read More »

133

click to vote

KDD
2002
ACM

119views Data Mining» more KDD 2002»

On effective classification of strings with wavelets

16 years 7 months ago

Download www.charuaggarwal.net

In recent years, the technological advances in mapping genes have made it increasingly easy to store and use a wide variety of biological data. Such data are usually in the form o...

Charu C. Aggarwal

claim paper

Read More »

« Prev « First page 107 / 142 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers