KDD 1998 | Sciweavers

We apply a well-known Bayesian probabilistic model to textual information retrieval: the classification of documents based on their relevance to a query. This model was previously...

Ernest P. Chan, Santiago Garcia, Salim Roukos

claim paper

Read More »

13

click to vote

KDD
1998
ACM

140views Data Mining» more KDD 1998»

Blurring the Distinction between Command and Data in Scientific KDD

13 years 9 months ago

Download www.aaai.org

We have been working on two different KDD systems for scientific data. One system involves comparative genomics, where the database contains more than 60,000 plant gene and protei...

John V. Carlis, Elizabeth Shoop, Scott Krieger

claim paper

Read More »

11

click to vote

KDD
1998
ACM

84views Data Mining» more KDD 1998»

13 years 9 months ago

Similarity of Attributes by External Probes

Download ranger.uta.edu

In data mining, similarity or distance between attributes is one of the central notions. Such a notion can be used to build attribute hierarchies etc. Similarity metrics can be us...

Gautam Das, Heikki Mannila, Pirjo Ronkainen

claim paper

Read More »

17

click to vote

KDD
1998
ACM

141views Data Mining» more KDD 1998»

Rule Discovery from Time Series

13 years 9 months ago

Download reference.kfupm.edu.sa

We consider the problem of nding rules relating patterns in a time series to other patterns in that series, or patterns in one series to patterns in another series. A simple examp...

Gautam Das, King-Ip Lin, Heikki Mannila, Gopal Ren...

claim paper

Read More »

7

click to vote

KDD
1998
ACM

107views Data Mining» more KDD 1998»

Giga-Mining

13 years 9 months ago

Download www.aaai.org

Wedescribe an industrial-strength data mining application in telecommunications.Theapplication requires building a short (7 byte) profile for all telephonenumbersseen on a large t...

Corinna Cortes, Daryl Pregibon

claim paper

Read More »

8

click to vote

KDD
1998
ACM

102views Data Mining» more KDD 1998»

Joins that Generalize: Text Classification Using WHIRL

13 years 9 months ago

Download www.aaai.org

WHIRL is an extensionof relational databasesthat canperform "soft joins" basedon the similarity of textual identifiers;thesesoftjoins extendthe traditional operationof j...

William W. Cohen, Haym Hirsh

claim paper

Read More »

14

click to vote

KDD
1998
ACM

123views Data Mining» more KDD 1998»

Scaling Clustering Algorithms to Large Databases

13 years 9 months ago

Download www.aaai.org

Practical clustering algorithms require multiple data scans to achieve convergence. For large databases, these scans become prohibitively expensive. We present a scalable clusteri...

Paul S. Bradley, Usama M. Fayyad, Cory Reina

claim paper

Read More »

21

click to vote

KDD
1998
ACM

228views Data Mining» more KDD 1998»

Direct Marketing Response Models Using Genetic Algorithms

13 years 9 months ago

Download www.aaai.org

Direct marketing response models seek to identify individuals most likely to respond to marketing solicitations. Such models are commonly evaluatedon classification accuracyand so...

Siddhartha Bhattacharyya

claim paper

Read More »

8

click to vote

KDD
1998
ACM

103views Data Mining» more KDD 1998»

CLOUDS: A Decision Tree Classifier for Large Datasets

13 years 9 months ago