Sciweavers

93 search results - page 3 / 19
» Reporting bias when using real data sets to analyze classifi...
Sort
View
ICDE
2005
IEEE
106views Database» more  ICDE 2005»
14 years 7 months ago
Effective Computation of Biased Quantiles over Data Streams
Skewis prevalentin manydata sourcessuchas IP traffic streams. To continually summarize the distribution of such data, a highbiased set of quantiles (e.g., 50th, 90th and 99th perc...
Graham Cormode, Flip Korn, S. Muthukrishnan, Dives...
ICPR
2002
IEEE
14 years 6 months ago
Adaptive Kernel Metric Nearest Neighbor Classification
Nearest neighbor classification assumes locally constant class conditional probabilities. This assumption becomes invalid in high dimensions due to the curse-ofdimensionality. Sev...
Jing Peng, Douglas R. Heisterkamp, H. K. Dai
PAKDD
2000
ACM
128views Data Mining» more  PAKDD 2000»
13 years 9 months ago
A Comparative Study of Classification Based Personal E-mail Filtering
This paper addresses personal E-mail filtering by casting it in the framework of text classification. Modeled as semi-structured documents, Email messages consist of a set of field...
Yanlei Diao, Hongjun Lu, Dekai Wu
CVPR
2000
IEEE
13 years 9 months ago
Adaptive Metric nearest Neighbor Classification
Nearest neighbor classification assumes locally constant class conditional probabilities. This assumption becomes invalid in high dimensions with finite samples due to the curse o...
Carlotta Domeniconi, Dimitrios Gunopulos, Jing Pen...
CHI
1994
ACM
13 years 10 months ago
Using aggregation and dynamic queries for exploring large data sets
When working with large data sets, users perform three primary types of activities: data manipulation, data analysis, and data visualization. The data manipulation process involve...
Jade Goldstein, Steven F. Roth