Sciweavers

1233 search results - page 177 / 247
» Reinforcement learning
Sort
View
AROBOTS
2008
131views more  AROBOTS 2008»
15 years 3 months ago
Active audition using the parameter-less self-organising map
This paper presents a novel method for enabling a robot to determine the position of a sound source in three dimensions using just two microphones and interaction with its environm...
Erik Berglund, Joaquin Sitte, Gordon Wyeth
AGI
2011
14 years 7 months ago
Measuring Agent Intelligence via Hierarchies of Environments
Under Legg’s and Hutter’s formal measure [1], performance in easy environments counts more toward an agent’s intelligence than does performance in difficult environments. An ...
Bill Hibbard
CORR
2006
Springer
140views Education» more  CORR 2006»
15 years 4 months ago
Nearly optimal exploration-exploitation decision thresholds
While in general trading off exploration and exploitation in reinforcement learning is hard, under some formulations relatively simple solutions exist. Optimal decision thresholds ...
Christos Dimitrakakis
ECAI
2006
Springer
15 years 7 months ago
Using Emotions for Behaviour-Selection Learning
Emotions play a very important role in human behaviour and social interaction. In this paper we present a control architecture which uses emotions in the behaviour selection proces...
Maria Malfaz, Miguel Angel Salichs
ICML
2009
IEEE
16 years 4 months ago
Constraint relaxation in approximate linear programs
Approximate Linear Programming (ALP) is a reinforcement learning technique with nice theoretical properties, but it often performs poorly in practice. We identify some reasons for...
Marek Petrik, Shlomo Zilberstein