Data Mining | Sciweavers

17

KDD
2008
ACM

142views Data Mining» more KDD 2008»

Weighted graphs and disconnected components: patterns and a generator

14 years 6 months ago

The vast majority of earlier work has focused on graphs which are both connected (typically by ignoring all but the giant connected component), and unweighted. Here we study numer...

Mary McGlohon, Leman Akoglu, Christos Faloutsos

claim paper

Read More »

19

click to vote

KDD
2008
ACM

172views Data Mining» more KDD 2008»

Structured metric learning for high dimensional problems

14 years 6 months ago

Download www.cs.utexas.edu

The success of popular algorithms such as k-means clustering or nearest neighbor searches depend on the assumption that the underlying distance functions reflect domain-specific n...

Jason V. Davis, Inderjit S. Dhillon

claim paper

Read More »

17

click to vote

KDD
2008
ACM

110views Data Mining» more KDD 2008»

Mining preferences from superior and inferior examples

14 years 6 months ago

Download www.cs.sfu.ca

Mining user preferences plays a critical role in many important applications such as customer relationship management (CRM), product and service recommendation, and marketing camp...

Bin Jiang, Jian Pei, Xuemin Lin, David W. Cheung, ...

claim paper

Read More »

13

click to vote

KDD
2008
ACM

186views Data Mining» more KDD 2008»

Scalable and near real-time burst detection from eCommerce queries

14 years 6 months ago

Download www.cse.ust.hk

In large scale online systems like Search, eCommerce, or social network applications, user queries represent an important dimension of activities that can be used to study the imp...

Nish Parikh, Neel Sundaresan

claim paper

Read More »

21

click to vote

KDD
2008
ACM

120views Data Mining» more KDD 2008»

Multi-class cost-sensitive boosting with p-norm loss functions

14 years 6 months ago

Download www.research.ibm.com

We propose a family of novel cost-sensitive boosting methods for multi-class classification by applying the theory of gradient boosting to p-norm based cost functionals. We establ...

Aurelie C. Lozano, Naoki Abe

claim paper

Read More »

17

click to vote

KDD
2008
ACM

152views Data Mining» more KDD 2008»

Automatic record linkage using seeded nearest neighbour and support vector machine classification

14 years 6 months ago

Download cs.anu.edu.au

Peter Christen

claim paper

Read More »

13

click to vote

KDD
2008
ACM

132views Data Mining» more KDD 2008»

Partitioned logistic regression for spam filtering

14 years 6 months ago

Download research.microsoft.com

Naive Bayes and logistic regression perform well in different regimes. While the former is a very simple generative model which is efficient to train and performs well empirically...

Ming-wei Chang, Wen-tau Yih, Christopher Meek

claim paper

Read More »

21

click to vote

KDD
2008
ACM

159views Data Mining» more KDD 2008»

Semi-supervised learning with data calibration for long-term time series forecasting

14 years 6 months ago

Download www.cse.msu.edu

Many time series prediction methods have focused on single step or short term prediction problems due to the inherent difficulty in controlling the propagation of errors from one ...

Haibin Cheng, Pang-Ning Tan

claim paper

Read More »

21

click to vote

KDD
2008
ACM

138views Data Mining» more KDD 2008»

Quantitative evaluation of approximate frequent pattern mining algorithms

14 years 6 months ago

Download www-users.cs.umn.edu

Traditional association mining algorithms use a strict definition of support that requires every item in a frequent itemset to occur in each supporting transaction. In real-life d...

Rohit Gupta, Gang Fang, Blayne Field, Michael Stei...

claim paper

Read More »

17

click to vote

KDD
2008
ACM

217views Data Mining» more KDD 2008»

Stream prediction using a generative model based on frequent episodes in event sequences

14 years 6 months ago

Download research.microsoft.com

This paper presents a new algorithm for sequence prediction over long categorical event streams. The input to the algorithm is a set of target event types whose occurrences we wis...

Srivatsan Laxman, Vikram Tankasali, Ryen W. White

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers