Search Sciweavers | Sciweavers

1227 search results - page 45 / 246

» Learning Rates for Q-Learning

159

click to vote

CVPR
2009
IEEE

371views Computer Vision» more CVPR 2009»

Unsupervised Learning for Graph Matching

16 years 10 months ago

Download www.ri.cmu.edu

Graph matching is an important problem in computer vision. It is used in 2D and 3D object matching and recognition. Despite its importance, there is little literature on learnin...

Marius Leordeanu, Martial Hebert

claim paper

Read More »

124

click to vote

ICML
2003
IEEE

137views Machine Learning» more ICML 2003»

The Use of the Ambiguity Decomposition in Neural Network Ensemble Learning Methods

16 years 4 months ago

Download www.hpl.hp.com

We analyze the formal grounding behind Negative Correlation (NC) Learning, an ensemble learning technique developed in the evolutionary computation literature. We show that by rem...

Gavin Brown, Jeremy L. Wyatt

claim paper

Read More »

click to vote

ECML
2005
Springer

95views Machine Learning» more ECML 2005»

Towards Finite-Sample Convergence of Direct Reinforcement Learning

15 years 9 months ago

Download www.cs.uiuc.edu

Abstract. While direct, model-free reinforcement learning often performs better than model-based approaches in practice, only the latter have yet supported theoretical guarantees f...

Shiau Hong Lim, Gerald DeJong

claim paper

Read More »

108

click to vote

ICRA
2002
IEEE

87views Robotics» more ICRA 2002»

Visual Guided Grasping of Aggregates using Self-Valuing Learning

15 years 8 months ago

Download www6.in.tum.de

We present a self-valuing learning technique which is capable of learning how to grasp unfamiliar objects and generalize the learned abilities. The learning system consists of two...

Bernd Rössler, Jianwei Zhang, Alois Knoll

claim paper

Read More »

150

click to vote

AGENTS
2001
Springer

201views Security Privacy» more AGENTS 2001»

Using background knowledge to speed reinforcement learning in physical agents

15 years 8 months ago

Download www.isle.org

This paper describes Icarus, an agent architecture that embeds a hierarchical reinforcement learning algorithm within a language for specifying agent behavior. An Icarus program e...

Daniel G. Shapiro, Pat Langley, Ross D. Shachter

claim paper

Read More »

« Prev « First page 45 / 246 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers