Sciweavers

1577 search results - page 193 / 316
» Data Mining: Machine Learning, Statistics, and Databases
Sort
View
CSB
2005
IEEE
189views Bioinformatics» more  CSB 2005»
15 years 7 months ago
Learning Yeast Gene Functions from Heterogeneous Sources of Data Using Hybrid Weighted Bayesian Networks
We developed a machine learning system for determining gene functions from heterogeneous sources of data sets using a Weighted Naive Bayesian Network (WNB). The knowledge of gene ...
Xutao Deng, Huimin Geng, Hesham H. Ali
KDD
2009
ACM
180views Data Mining» more  KDD 2009»
16 years 2 months ago
Consensus group stable feature selection
Stability is an important yet under-addressed issue in feature selection from high-dimensional and small sample data. In this paper, we show that stability of feature selection ha...
Steven Loscalzo, Lei Yu, Chris H. Q. Ding
SIGIR
2010
ACM
15 years 5 months ago
Self-taught hashing for fast similarity search
The ability of fast similarity search at large scale is of great importance to many Information Retrieval (IR) applications. A promising way to accelerate similarity search is sem...
Dell Zhang, Jun Wang, Deng Cai, Jinsong Lu
CIKM
2010
Springer
15 years 8 days ago
Partial drift detection using a rule induction framework
The major challenge in mining data streams is the issue of concept drift, the tendency of the underlying data generation process to change over time. In this paper, we propose a g...
Damon Sotoudeh, Aijun An
ICDE
2007
IEEE
167views Database» more  ICDE 2007»
15 years 8 months ago
Load Shedding for Window Joins on Multiple Data Streams
We consider the problem of semantic load shedding for continuous queries containing window joins on multiple data streams and propose a robust approach that is effective with the ...
Yan-Nei Law, Carlo Zaniolo