Sciweavers

1427 search results - page 208 / 286
» Learning with Annotation Noise
Sort
View
ICDAR
2007
IEEE
15 years 1 months ago
Identification of Latin-Based Languages through Character Stroke Categorization
This paper presents a language identification technique that detects Latin-based languages of imaged documents without OCR. The proposed technique detects languages through the wo...
S. J. Lu, L. Li, Chew Lim Tan
NIPS
2007
14 years 11 months ago
DIFFRAC: a discriminative and flexible framework for clustering
We present a novel linear clustering framework (DIFFRAC) which relies on a linear discriminative cost function and a convex relaxation of a combinatorial optimization problem. The...
Francis Bach, Zaïd Harchaoui
RIAO
2007
14 years 11 months ago
Using the Knowledge of Object Colors to Segment Images and Improve Web Image Search
With web image search engines, we face a situation where the results are very noisy, and when we ask for a specific object, we are not ensured that this object is contained in all...
Christophe Millet, Isabelle Bloch
FLAIRS
2006
14 years 11 months ago
Using Validation Sets to Avoid Overfitting in AdaBoost
AdaBoost is a well known, effective technique for increasing the accuracy of learning algorithms. However, it has the potential to overfit the training set because its objective i...
Tom Bylander, Lisa Tate
TREC
2003
14 years 11 months ago
Experiments in TREC 2003 Genomics Track at NTT
500,000 PubMed abstracts. However, less than 50 documents are relevant for most queries. Applying scoring to all 500,000 abstracts would create a lot of noise. In the first step, ...
Hirotoshi Taira, Tomonori Izumitani, Tsutomu Hirao...