Sciweavers

1233 search results - page 45 / 247
» Reinforcement learning
Sort
View
ICML
2009
IEEE
16 years 2 months ago
The adaptive k-meteorologists problem and its application to structure learning and feature selection in reinforcement learning
The purpose of this paper is three-fold. First, we formalize and study a problem of learning probabilistic concepts in the recently proposed KWIK framework. We give details of an ...
Carlos Diuk, Lihong Li, Bethany R. Leffler
COLT
2000
Springer
15 years 6 months ago
Estimation and Approximation Bounds for Gradient-Based Reinforcement Learning
We model reinforcement learning as the problem of learning to control a Partially Observable Markov Decision Process (  ¢¡¤£¦¥§  ), and focus on gradient ascent approache...
Peter L. Bartlett, Jonathan Baxter
IJAIT
2008
146views more  IJAIT 2008»
15 years 1 months ago
Learning to Behave in Space: a Qualitative Spatial Representation for Robot Navigation with Reinforcement Learning
ion mechanism to create a representation of space consisting of the circular order of detected landmarks and the relative position of walls towards the agent's moving directio...
Lutz Frommberger
44
Voted
ATAL
2009
Springer
15 years 8 months ago
Learning with whom to communicate using relational reinforcement learning
Marc J. V. Ponsen, Tom Croonenborghs, Karl Tuyls, ...