Search Sciweavers | Sciweavers

1233 search results - page 177 / 247

» Reinforcement learning

174

click to vote

AROBOTS
2008

131views more AROBOTS 2008»

Active audition using the parameter-less self-organising map

15 years 5 months ago

Download nicta.com.au

This paper presents a novel method for enabling a robot to determine the position of a sound source in three dimensions using just two microphones and interaction with its environm...

Erik Berglund, Joaquin Sitte, Gordon Wyeth

claim paper

Read More »

179

click to vote

AGI
2011

222views Artificial Intelligence» more AGI 2011»

Measuring Agent Intelligence via Hierarchies of Environments

14 years 9 months ago

Download www.ssec.wisc.edu

Under Legg’s and Hutter’s formal measure [1], performance in easy environments counts more toward an agent’s intelligence than does performance in difficult environments. An ...

Bill Hibbard

claim paper

Read More »

210

click to vote

CORR
2006
Springer

140views Education» more CORR 2006»

Nearly optimal exploration-exploitation decision thresholds

15 years 6 months ago

Download www.idiap.ch

While in general trading off exploration and exploitation in reinforcement learning is hard, under some formulations relatively simple solutions exist. Optimal decision thresholds ...

Christos Dimitrakakis

posted by olethros

Read More »

167

click to vote

ECAI
2006
Springer

127views Artificial Intelligence» more ECAI 2006»

Using Emotions for Behaviour-Selection Learning

15 years 9 months ago

Download roboticslab.uc3m.es

Emotions play a very important role in human behaviour and social interaction. In this paper we present a control architecture which uses emotions in the behaviour selection proces...

Maria Malfaz, Miguel Angel Salichs

claim paper

Read More »

136

click to vote

ICML
2009
IEEE

123views Machine Learning» more ICML 2009»

Constraint relaxation in approximate linear programs

16 years 6 months ago

Download anytime.cs.umass.edu

Approximate Linear Programming (ALP) is a reinforcement learning technique with nice theoretical properties, but it often performs poorly in practice. We identify some reasons for...

Marek Petrik, Shlomo Zilberstein

claim paper

Read More »

« Prev « First page 177 / 247 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers