Sciweavers

6388 search results - page 201 / 1278
» High Performance Data Mining
Sort
View
148
Voted
PVLDB
2008
107views more  PVLDB 2008»
15 years 4 months ago
Finding relevant patterns in bursty sequences
Sequence data is ubiquitous and finding frequent sequences in a large database is one of the most common problems when analyzing sequence data. Unfortunately many sources of seque...
Alexander Lachmann, Mirek Riedewald
126
Voted
ICML
2008
IEEE
16 years 5 months ago
An empirical evaluation of supervised learning in high dimensions
In this paper we perform an empirical evaluation of supervised learning on highdimensional data. We evaluate performance on three metrics: accuracy, AUC, and squared loss and stud...
Rich Caruana, Nikolaos Karampatziakis, Ainur Yesse...
LCPC
2005
Springer
15 years 10 months ago
Applying Data Copy to Improve Memory Performance of General Array Computations
Abstract. Data copy is an important compiler optimization which dynamically rearranges the layout of arrays by copying their elements into local buffers. Traditionally, array copy...
Qing Yi
HAIS
2009
Springer
15 years 9 months ago
Unsupervised Feature Selection in High Dimensional Spaces and Uncertainty
Developing models and methods to manage data vagueness is a current effervescent research field. Some work has been done with supervised problems but unsupervised problems and unce...
José Ramón Villar, María del ...
262
Voted
SIGMOD
2006
ACM
219views Database» more  SIGMOD 2006»
16 years 4 months ago
Modeling skew in data streams
Data stream applications have made use of statistical summaries to reason about the data using nonparametric tools such as histograms, heavy hitters, and join sizes. However, rela...
Flip Korn, S. Muthukrishnan, Yihua Wu