Sciweavers

1235 search results - page 114 / 247

» Reinforcement learning in a nutshell

103

ICAART
2010
INSTICC

136views Intelligent Agents» more ICAART 2010»

A Reinforcement Learning Approach for Multiagent Navigation

16 years 2 months ago

A Reinforcement Learning Approach for Multiagent Navigation

Download scalab.uc3m.es

Francisco Martinez-Gil, Fernando Barber, Miguel Lo...

claim paper

Read More »

149

ICAART
2010
INSTICC

222views Intelligent Agents» more ICAART 2010»

Exploiting Similarity Information in Reinforcement Learning - Similarity Models for Multi-Armed Bandits and MDPs

16 years 2 months ago

Exploiting Similarity Information in Reinforcement Learning - Similarity Models for Multi-Armed Bandits and MDPs

Download personal.unileoben.ac.at

Ronald Ortner

claim paper

Read More »

112

Voted

ICAART
2010
INSTICC

288views Intelligent Agents» more ICAART 2010»

A Cautious Approach to Generalization in Reinforcement Learning

16 years 2 months ago

A Cautious Approach to Generalization in Reinforcement Learning

Download www.montefiore.ulg.ac.be

Raphael Fonteneau, Susan A. Murphy, Louis Wehenkel...

claim paper

Read More »

148

IUI
2009
ACM

110views Software Engineering» more IUI 2009»

A bayesian reinforcement learning approach for customizing human-robot interfaces

16 years 9 days ago

A bayesian reinforcement learning approach for customizing human-robot interfaces

Download www.cs.mcgill.ca

Amin Atrash, Joelle Pineau

claim paper

Read More »

160

ISDA
2009
IEEE

144views Operating System» more ISDA 2009»

Postponed Updates for Temporal-Difference Reinforcement Learning

16 years 4 days ago

Postponed Updates for Temporal-Difference Reinforcement Learning

Download www.science.uva.nl

This paper presents postponed updates, a new strategy for TD methods that can improve sample efﬁciency without incurring the computational and space requirements of model-based ...

Harm van Seijen, Shimon Whiteson

claim paper

Read More »

« Prev « First page 114 / 247 Last » Next »