Search Sciweavers | Sciweavers

1233 search results - page 45 / 247

» Reinforcement learning

156

ICML
2009
IEEE

160views Machine Learning» more ICML 2009»

The adaptive k-meteorologists problem and its application to structure learning and feature selection in reinforcement learning

16 years 6 months ago

Download www.research.rutgers.edu

The purpose of this paper is three-fold. First, we formalize and study a problem of learning probabilistic concepts in the recently proposed KWIK framework. We give details of an ...

Carlos Diuk, Lihong Li, Bethany R. Leffler

claim paper

Read More »

132

click to vote

COLT
2000
Springer

87views Machine Learning» more COLT 2000»

Estimation and Approximation Bounds for Gradient-Based Reinforcement Learning

15 years 10 months ago

Download www.cs.iastate.edu

We model reinforcement learning as the problem of learning to control a Partially Observable Markov Decision Process ( ¢¡¤£¦¥§ ), and focus on gradient ascent approache...

Peter L. Bartlett, Jonathan Baxter

claim paper

Read More »

175

click to vote

IJAIT
2008

146views more IJAIT 2008»

Learning to Behave in Space: a Qualitative Spatial Representation for Robot Navigation with Reinforcement Learning

15 years 6 months ago

Download www.aussagekraft.de

ion mechanism to create a representation of space consisting of the circular order of detected landmarks and the relative position of walls towards the agent's moving directio...

Lutz Frommberger

claim paper

Read More »

click to vote

ATAL
2009
Springer

77views Intelligent Agents» more ATAL 2009»

Learning with whom to communicate using relational reinforcement learning

16 years 17 days ago

Download www.aamas-conference.org

Marc J. V. Ponsen, Tom Croonenborghs, Karl Tuyls, ...

claim paper

Read More »