Search Sciweavers | Sciweavers

35 search results - page 3 / 7

» Very fast action selection for parameterized behaviors

click to vote

ICRA
2010
IEEE

117views Robotics» more ICRA 2010»

Learning reliable and efficient navigation with a humanoid

13 years 4 months ago

Download hrl.informatik.uni-freiburg.de

Reliable and efficient navigation with a humanoid robot is a difficult task. First, the motion commands are executed rather inaccurately due to backlash in the joints or foot slipp...

Stefan Oßwald, Armin Hornung, Maren Bennewit...

claim paper

Read More »

click to vote

ICML
2005
IEEE

104views Machine Learning» more ICML 2005»

Fast condensed nearest neighbor rule

14 years 7 months ago

Download www.machinelearning.org

We present a novel algorithm for computing a training set consistent subset for the nearest neighbor decision rule. The algorithm, called FCNN rule, has some desirable properties....

Fabrizio Angiulli

claim paper

Read More »

click to vote

ICML
2003
IEEE

121views Machine Learning» more ICML 2003»

Q-Decomposition for Reinforcement Learning Agents

14 years 7 months ago

Download www.hpl.hp.com

The paper explores a very simple agent design method called Q-decomposition, wherein a complex agent is built from simpler subagents. Each subagent has its own reward function and...

Stuart J. Russell, Andrew Zimdars

claim paper

Read More »

click to vote

PERCOM
2004
ACM

88views Computer Networks» more PERCOM 2004»

Employing User Feedback for Fast, Accurate, Low-Maintenance Geolocationing

14 years 5 months ago

Download cseweb.ucsd.edu

One way to improve inferences on sensor data is to tune the algorithms through a time-consuming offline procedure. A less expensive, and potentially more accurate method is to use...

Ezekiel S. Bhasker, Steven W. Brown, William G. Gr...

claim paper

Read More »

click to vote

ATAL
2005
Springer

181views Intelligent Agents» more ATAL 2005»

Improving reinforcement learning function approximators via neuroevolution

13 years 11 months ago

Download www.aaai.org

Reinforcement learning problems are commonly tackled with temporal difference methods, which use dynamic programming and statistical sampling to estimate the long-term value of ta...

Shimon Whiteson

claim paper

Read More »

« Prev « First page 3 / 7 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers