Sciweavers

KDD
2007
ACM
179views Data Mining» more  KDD 2007»
13 years 10 months ago
Mining statistically important equivalence classes and delta-discriminative emerging patterns
The support-confidence framework is the most common measure used in itemset mining algorithms, for its antimonotonicity that effectively simplifies the search lattice. This com...
Jinyan Li, Guimei Liu, Limsoon Wong
KDD
2007
ACM
124views Data Mining» more  KDD 2007»
13 years 10 months ago
Hierarchical mixture models: a probabilistic analysis
Mixture models form one of the most widely used classes of generative models for describing structured and clustered data. In this paper we develop a new approach for the analysis...
Mark Sandler
KDD
2007
ACM
127views Data Mining» more  KDD 2007»
13 years 10 months ago
Extracting relevant named entities for automated expense reimbursement
Guangyu Zhu, Timothy J. Bethea, Vikas Krishna
KDD
2007
ACM
210views Data Mining» more  KDD 2007»
13 years 10 months ago
Machine learning for stock selection
In this paper, we propose a new method called Prototype Ranking (PR) designed for the stock selection problem. PR takes into account the huge size of real-world stock data and app...
Robert J. Yan, Charles X. Ling
KDD
2007
ACM
145views Data Mining» more  KDD 2007»
13 years 10 months ago
Tracking multiple topics for finding interesting articles
Raymond K. Pon, Alfonso F. Cardenas, David Buttler...
KDD
2007
ACM
138views Data Mining» more  KDD 2007»
13 years 10 months ago
High-quantile modeling for customer wallet estimation and other applications
In this paper we discuss the important practical problem of customer wallet estimation, i.e., estimation of potential spending by customers (rather than their expected spending). ...
Claudia Perlich, Saharon Rosset, Richard D. Lawren...
KDD
2007
ACM
139views Data Mining» more  KDD 2007»
14 years 5 months ago
Looking for Great Ideas: Analyzing the Innovation Jam
We discuss the Innovation Jam that IBM carried out in 2006, with the objective of identifying innovative and promising "Big Ideas" through a moderated on-line discussion...
Wojciech Gryc, Mary E. Helander, Richard D. Lawren...
KDD
2007
ACM
143views Data Mining» more  KDD 2007»
14 years 5 months ago
Mining Research Communities in Bibliographical Data
Abstract. Extracting information from very large collections of structured, semistructured or even unstructured data can be a considerable challenge when much of the hidden informa...
Osmar R. Zaïane, Jiyang Chen, Randy Goebel
KDD
2007
ACM
198views Data Mining» more  KDD 2007»
14 years 5 months ago
Applying Link-Based Classification to Label Blogs
In analyzing data from social and communication networks, we encounter the problem of classifying objects where there is an explicit link structure amongst the objects. We study t...
Smriti Bhagat, Graham Cormode, Irina Rozenbaum
KDD
2007
ACM
244views Data Mining» more  KDD 2007»
14 years 5 months ago
A Recommender System Based on Local Random Walks and Spectral Methods
In this paper, we design recommender systems for weblogs based on the link structure among them. We propose algorithms based on refined random walks and spectral methods. First, w...
Zeinab Abbassi, Vahab S. Mirrokni