Transforming Auto-Encoders

14 years 11 months ago

Download www.cs.toronto.edu

The artiﬁcial neural networks that are used to recognize shapes typically use one or more layers of learned feature detectors that produce scalar outputs. By contrast, the computer vision community uses complicated, hand-engineered features, like SIFT [6], that produce a whole vector of outputs including an explicit representation of the pose of the feature. We show how neural networks can be used to learn features that output a whole vector of instantiation parameters and we argue that this is a much more promising way of dealing with variations in position, orientation, scale and lighting than the methods currently employed in the neural networks community. It is also more promising than the handengineered features currently used in computer vision because it provides an eﬃcient way of adapting the features to the domain.

Geoffrey E. Hinton, Alex Krizhevsky, Sida D. Wang

Real-time Traffic

Computer Vision Community | Explicit Representation | Feature Detectors | ICANN 2011 | Neural Networks |

claim paper

Post Info
More Details (n/a)

Added	29 Aug 2011
Updated	29 Aug 2011
Type	Journal
Year	2011
Where	ICANN
Authors	Geoffrey E. Hinton, Alex Krizhevsky, Sida D. Wang

Comments (0)

Sciweavers

Transforming Auto-Encoders

Computer Vision Community | Explicit Representation | Feature Detectors | ICANN 2011 | Neural Networks |

Explore & Download

Productivity Tools

Sciweavers