Sciweavers

99 search results - page 18 / 20
» Action Selection in Bayesian Reinforcement Learning
Sort
View
ICML
2010
IEEE
13 years 7 months ago
Toward Off-Policy Learning Control with Function Approximation
We present the first temporal-difference learning algorithm for off-policy control with unrestricted linear function approximation whose per-time-step complexity is linear in the ...
Hamid Reza Maei, Csaba Szepesvári, Shalabh ...
EWCBR
2008
Springer
13 years 7 months ago
Discovering Feature Weights for Feature-based Indexing of Q-tables
In this paper we propose an approach to address the old problem of identifying the feature conditions under which a gaming strategy can be effective. For doing this, we will build ...
Chad Hogg, Stephen Lee-Urban, Bryan Auslander, H&e...
LWA
2007
13 years 7 months ago
Towards Learning User-Adaptive State Models in a Conversational Recommender System
Typical conversational recommender systems support interactive strategies that are hard-coded in advance and followed rigidly during a recommendation session. In fact, Reinforceme...
Tariq Mahmood, Francesco Ricci
AR
1998
106views more  AR 1998»
13 years 5 months ago
A cognitive robot architecture based on tactile and visual information
In this paper, we propose an architecture for a cognitive robot based on tactile and visual information. Visual information contains various features such as location and area of ...
Kazunori Terada, Takayuki Nakamura, Hideaki Takeda...
DALT
2005
Springer
13 years 11 months ago
An Architecture for Rational Agents
Abstract. This paper is concerned with designing architectures for rational agents. In the proposed architecture, agents have belief bases that are theories in a multi-modal, highe...
John W. Lloyd, Tim D. Sears