EWRL 2008 | Sciweavers

41

EWRL
2008

186views Machine Learning» more EWRL 2008»

Efficient Reinforcement Learning in Parameterized Models: Discrete Parameter Case

13 years 11 months ago

We consider reinforcement learning in the parameterized setup, where the model is known to belong to a parameterized family of Markov Decision Processes (MDPs). We further impose ...

Kirill Dyagilev, Shie Mannor, Nahum Shimkin

claim paper

Read More »

25

click to vote

EWRL
2008

121views Machine Learning» more EWRL 2008»

Variable Metric Reinforcement Learning Methods Applied to the Noisy Mountain Car Problem

13 years 11 months ago

Download www.neuroinformatik.ruhr-uni-bochum.de

Two variable metric reinforcement learning methods, the natural actor-critic algorithm and the covariance matrix adaptation evolution strategy, are compared on a conceptual level a...

Verena Heidrich-Meisner, Christian Igel

claim paper

Read More »

26

click to vote

EWRL
2008

143views Machine Learning» more EWRL 2008»

New Error Bounds for Approximations from Projected Linear Equations

13 years 11 months ago

Download www.mit.edu

We consider linear fixed point equations and their approximations by projection on a low dimensional subspace. We derive new bounds on the approximation error of the solution, whi...

Huizhen Yu, Dimitri P. Bertsekas

claim paper

Read More »

17

click to vote

EWRL
2008

121views Machine Learning» more EWRL 2008»

Probabilistic Inference for Fast Learning in Control

13 years 11 months ago

Download mlg.eng.cam.ac.uk

Carl Edward Rasmussen, Marc Peter Deisenroth

claim paper

Read More »

26

click to vote

EWRL
2008

129views Machine Learning» more EWRL 2008»

Markov Decision Processes with Arbitrary Reward Processes

13 years 11 months ago

Download www.cim.mcgill.ca

Abstract. We consider a control problem where the decision maker interacts with a standard Markov decision process with the exception that the reward functions vary arbitrarily ove...

Jia Yuan Yu, Shie Mannor, Nahum Shimkin

claim paper

Read More »

19

click to vote

EWRL
2008

158views Machine Learning» more EWRL 2008»

Basis Expansion in Natural Actor Critic Methods

13 years 11 months ago

Download www.ceng.metu.edu.tr

Sertan Girgin, Philippe Preux

claim paper

Read More »

22

click to vote

EWRL
2008

104views Machine Learning» more EWRL 2008»

Optimistic Planning of Deterministic Systems

13 years 11 months ago

Download eprints.pascal-network.org

If one possesses a model of a controlled deterministic system, then from any state, one may consider the set of all possible reachable states starting from that state and using any...

Jean-François Hren, Rémi Munos

claim paper

Read More »

26

click to vote

EWRL
2008

144views Machine Learning» more EWRL 2008»

Regularized Fitted Q-Iteration: Application to Planning

13 years 11 months ago

Download eprints.pascal-network.org

We consider planning in a Markovian decision problem, i.e., the problem of finding a good policy given access to a generative model of the environment. We propose to use fitted Q-i...

Amir Massoud Farahmand, Mohammad Ghavamzadeh, Csab...

claim paper

Read More »

20

click to vote

EWRL
2008

133views Machine Learning» more EWRL 2008»

Exploiting Additive Structure in Factored MDPs for Reinforcement Learning

13 years 11 months ago

Download ewrl08.futurs.inria.fr

Thomas Degris, Olivier Sigaud, Pierre-Henri Wuille...

claim paper

Read More »

32

click to vote

EWRL
2008

148views Machine Learning» more EWRL 2008»

Policy Learning - A Unified Perspective with Applications in Robotics

13 years 11 months ago

Download www.kyb.tuebingen.mpg.de

Policy Learning approaches are among the best suited methods for high-dimensional, continuous control systems such as anthropomorphic robot arms and humanoid robots. In this paper,...

Jan Peters, Jens Kober, Duy Nguyen-Tuong

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers