Sciweavers

1553 search results - page 172 / 311
» Learning from Multiple Sources of Inaccurate Data
Sort
View
130
Voted
ICMLA
2008
15 years 5 months ago
Mapping Uncharted Waters: Exploratory Analysis, Visualization, and Clustering of Oceanographic Data
In this paper we describe an interdisciplinary collaboration between researchers in machine learning and oceanography. The collaboration was formed to study the problem of open oc...
Joshua M. Lewis, Pincelli M. Hull, Kilian Q. Weinb...
138
Voted
ACMSE
2006
ACM
15 years 9 months ago
Automatic quality assessment of Affymetrix GeneChip data
Computing reliable gene expression levels from microarray experiments is a sophisticated process with many potential pitfalls. Quality control is one of the most important steps i...
Steffen Heber, Beate Sick
139
Voted
WWW
2006
ACM
16 years 4 months ago
Interactive wrapper generation with minimal user effort
While much of the data on the web is unstructured in nature, there is also a significant amount of embedded structured data, such as product information on e-commerce sites or sto...
Utku Irmak, Torsten Suel
ICASSP
2011
IEEE
14 years 7 months ago
Toward text message normalization: Modeling abbreviation generation
This paper describes a text normalization system for deletion-based abbreviations in informal text. We propose using statistical classifiers to learn the probability of deleting ...
Deana Pennell, Yang Liu
126
Voted
CVPR
2008
IEEE
16 years 5 months ago
Clustering and dimensionality reduction on Riemannian manifolds
We propose a novel algorithm for clustering data sampled from multiple submanifolds of a Riemannian manifold. First, we learn a representation of the data using generalizations of...
Alvina Goh, René Vidal