Sciweavers

1584 search results - page 273 / 317
» Data selection for support vector machine classifiers
Sort
View
BMCBI
2006
126views more  BMCBI 2006»
14 years 9 months ago
A Regression-based K nearest neighbor algorithm for gene function prediction from heterogeneous data
Background: As a variety of functional genomic and proteomic techniques become available, there is an increasing need for functional analysis methodologies that integrate heteroge...
Zizhen Yao, Walter L. Ruzzo
KDD
2003
ACM
214views Data Mining» more  KDD 2003»
15 years 10 months ago
Adaptive duplicate detection using learnable string similarity measures
The problem of identifying approximately duplicate records in databases is an essential step for data cleaning and data integration processes. Most existing approaches have relied...
Mikhail Bilenko, Raymond J. Mooney
ICAS
2006
IEEE
139views Robotics» more  ICAS 2006»
15 years 3 months ago
Predicting Resource Demand in Dynamic Utility Computing Environments
— We target the problem of predicting resource usage in situations where the modeling data is scarce, non-stationary, or expensive to obtain. This scenario occurs frequently in c...
Artur Andrzejak, Sven Graupner, Stefan Plantikow
ICDM
2006
IEEE
164views Data Mining» more  ICDM 2006»
15 years 3 months ago
Unsupervised Learning of Tree Alignment Models for Information Extraction
We propose an algorithm for extracting fields from HTML search results. The output of the algorithm is a database table– a data structure that better lends itself to high-level...
Philip Zigoris, Damian Eads, Yi Zhang
JMLR
2010
121views more  JMLR 2010»
14 years 4 months ago
Sparse Semi-supervised Learning Using Conjugate Functions
In this paper, we propose a general framework for sparse semi-supervised learning, which concerns using a small portion of unlabeled data and a few labeled data to represent targe...
Shiliang Sun, John Shawe-Taylor