Sciweavers

2383 search results - page 316 / 477
» Finding Representative Set from Massive Data
Sort
View
KDD
2008
ACM
128views Data Mining» more  KDD 2008»
16 years 2 months ago
Scaling up text classification for large file systems
: We combine the speed and scalability of information retrieval with the generally superior classification accuracy offered by machine learning, yielding a two-phase text classifie...
George Forman, Shyamsundar Rajaram
RECOMB
2008
Springer
16 years 2 months ago
Accounting for Non-genetic Factors Improves the Power of eQTL Studies
Abstract. The recent availability of large scale data sets profiling single nucleotide polymorphisms (SNPs) and gene expression across different human populations, has directed muc...
Oliver Stegle, Anitha Kannan, Richard Durbin, John...
VLDB
2007
ACM
121views Database» more  VLDB 2007»
16 years 2 months ago
Ranked Subsequence Matching in Time-Series Databases
Existing work on similar sequence matching has focused on either whole matching or range subsequence matching. In this paper, we present novel methods for ranked subsequence match...
Wook-Shin Han, Jinsoo Lee, Yang-Sae Moon, Haifeng ...
KDD
1997
ACM
78views Data Mining» more  KDD 1997»
15 years 6 months ago
Mining Generalized Term Associations: Count Propagation Algorithm
We presenthere an approachand algorithm for mining generalizedterm associations.The problem is to find co-occurrencefrequenciesof terms, given a collection of documents eachwith r...
Jonghyun Kahng, Wen-Hsiang Kevin Liao, Dennis McLe...
COMAD
2008
15 years 3 months ago
Concurrency Control in Distributed MRA Index Structure
Answering aggregate queries like sum, count, min, max over regions containing moving objects is often needed for virtual world applications, real-time monitoring systems, etc. Sin...
Neha Singh, S. Sudarshan