Search Sciweavers | Sciweavers

139 search results - page 11 / 28

» Model-based function approximation in reinforcement learning

160

click to vote

PKDD
2009
Springer

144views Data Mining» more PKDD 2009»

Compositional Models for Reinforcement Learning

16 years 1 months ago

Download userweb.cs.utexas.edu

Abstract. Innovations such as optimistic exploration, function approximation, and hierarchical decomposition have helped scale reinforcement learning to more complex environments, ...

Nicholas K. Jong, Peter Stone

claim paper

Read More »

151

click to vote

AAAI
2008

207views Intelligent Agents» more AAAI 2008»

Adaptive Importance Sampling with Automatic Model Selection in Value Function Approximation

15 years 8 months ago

Download sugiyama-www.cs.titech.ac.jp

Off-policy reinforcement learning is aimed at efficiently reusing data samples gathered in the past, which is an essential problem for physically grounded AI as experiments are us...

Hirotaka Hachiya, Takayuki Akiyama, Masashi Sugiya...

claim paper

Read More »

184

click to vote

IROS
2006
IEEE

190views Robotics» more IROS 2006»

Q-RAN: A Constructive Reinforcement Learning Approach for Robot Behavior Learning

16 years 14 days ago

Download www.aass.oru.se

Abstract— This paper presents a learning system that uses Qlearning with a resource allocating network (RAN) for behavior learning in mobile robotics. The RAN is used as a functi...

Jun Li, Achim J. Lilienthal, Tomás Mart&iac...

claim paper

Read More »

155

Voted

ICML
2003
IEEE

157views Machine Learning» more ICML 2003»

Action Elimination and Stopping Conditions for Reinforcement Learning

16 years 7 months ago

Download www.hpl.hp.com

We consider incorporating action elimination procedures in reinforcement learning algorithms. We suggest a framework that is based on learning an upper and a lower estimates of th...

Eyal Even-Dar, Shie Mannor, Yishay Mansour

claim paper

Read More »

183

click to vote

ICML
2010
IEEE

231views Machine Learning» more ICML 2010»

Toward Off-Policy Learning Control with Function Approximation

15 years 7 months ago

Download www.sztaki.hu

We present the first temporal-difference learning algorithm for off-policy control with unrestricted linear function approximation whose per-time-step complexity is linear in the ...

Hamid Reza Maei, Csaba Szepesvári, Shalabh ...

claim paper

Read More »

« Prev « First page 11 / 28 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers