Sciweavers

1204 search results - page 100 / 241
» Using Machine Learning Techniques for Stylometry
Sort
View
COLT
1998
Springer
15 years 5 months ago
Cross-Validation for Binary Classification by Real-Valued Functions: Theoretical Analysis
This paper concerns the use of real-valued functions for binary classification problems. Previous work in this area has concentrated on using as an error estimate the `resubstitut...
Martin Anthony, Sean B. Holden
ICML
2005
IEEE
16 years 2 months ago
Reducing overfitting in process model induction
In this paper, we review the paradigm of inductive process modeling, which uses background knowledge about possible component processes to construct quantitative models of dynamic...
Will Bridewell, Narges Bani Asadi, Pat Langley, Lj...
KDD
2003
ACM
214views Data Mining» more  KDD 2003»
16 years 1 months ago
Adaptive duplicate detection using learnable string similarity measures
The problem of identifying approximately duplicate records in databases is an essential step for data cleaning and data integration processes. Most existing approaches have relied...
Mikhail Bilenko, Raymond J. Mooney
ECML
1997
Springer
15 years 5 months ago
Constructing Intermediate Concepts by Decomposition of Real Functions
In learning from examples it is often useful to expand an attribute-vector representation by intermediate concepts. The usual advantage of such structuring of the learning problemi...
Janez Demsar, Blaz Zupan, Marko Bohanec, Ivan Brat...
ICML
2004
IEEE
16 years 2 months ago
Improving SVM accuracy by training on auxiliary data sources
The standard model of supervised learning assumes that training and test data are drawn from the same underlying distribution. This paper explores an application in which a second...
Pengcheng Wu, Thomas G. Dietterich