Search Sciweavers | Sciweavers

95

ICALP
2009
Springer

92views Programming Languages» more ICALP 2009»

16 years 2 months ago

We define stochastic timed games, which extend two-player timed games with probabilities (following a recent approach by Baier et al), and which extend in a natural way continuous-...

Patricia Bouyer, Vojtech Forejt

claim paper

Read More »

97

Voted

ICRA
2007
IEEE

126views Robotics» more ICRA 2007»

A formal framework for robot learning and control under model uncertainty

15 years 8 months ago

Download www.cs.mcgill.ca

— While the Partially Observable Markov Decision Process (POMDP) provides a formal framework for the problem of robot control under uncertainty, it typically assumes a known and ...

Robin Jaulmes, Joelle Pineau, Doina Precup

claim paper

Read More »

81

click to vote

ECML
2007
Springer

108views Machine Learning» more ECML 2007»

Safe Q-Learning on Complete History Spaces

15 years 8 months ago

Download www.ni.uos.de

In this article, we present an idea for solving deterministic partially observable markov decision processes (POMDPs) based on a history space containing sequences of past observat...

Stephan Timmer, Martin Riedmiller

claim paper

Read More »

83

click to vote

ICANN
2007
Springer

95views Neural Networks» more ICANN 2007»

Solving Deep Memory POMDPs with Recurrent Policy Gradients

15 years 8 months ago

Download www.idsia.ch

Abstract. This paper presents Recurrent Policy Gradients, a modelfree reinforcement learning (RL) method creating limited-memory stochastic policies for partially observable Markov...

Daan Wierstra, Alexander Förster, Jan Peters,...

claim paper

Read More »

148

Voted

ICML
2006
IEEE

256views Machine Learning» more ICML 2006»

Automatic basis function construction for approximate dynamic programming and reinforcement learning

15 years 7 months ago

Download www.ece.mcgill.ca

We address the problem of automatically constructing basis functions for linear approximation of the value function of a Markov Decision Process (MDP). Our work builds on results ...

Philipp W. Keller, Shie Mannor, Doina Precup

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers