Search Sciweavers | Sciweavers

178 search results - page 26 / 36

» Efficient Approximation of Optimal Control for Markov Games

111

click to vote

AMAI
2004
Springer

164views Artificial Intelligence» more AMAI 2004»

A Framework for Sequential Planning in Multi-Agent Settings

15 years 5 months ago

Download www.jair.org

This paper extends the framework of partially observable Markov decision processes (POMDPs) to multi-agent settings by incorporating the notion of agent models into the state spac...

Piotr J. Gmytrasiewicz, Prashant Doshi

claim paper

Read More »

click to vote

CVPR
2007
IEEE

151views Computer Vision» more CVPR 2007»

Efficient MRF Deformation Model for Non-Rigid Image Matching

16 years 1 months ago

Download cmp.felk.cvut.cz

We propose a novel MRF-based model for deformable image matching (also known as registration). The deformation is described by a field of discrete variables, representing displace...

Alexander Shekhovtsov, Ivan Kovtun, Václav ...

claim paper

Read More »

141

click to vote

JMLR
2010

148views more JMLR 2010»

A Generalized Path Integral Control Approach to Reinforcement Learning

14 years 6 months ago

Download jmlr.csail.mit.edu

With the goal to generate more scalable algorithms with higher efficiency and fewer open parameters, reinforcement learning (RL) has recently moved towards combining classical tec...

Evangelos Theodorou, Jonas Buchli, Stefan Schaal

claim paper

Read More »

click to vote

ICML
2000
IEEE

126views Machine Learning» more ICML 2000»

Reinforcement Learning in POMDP's via Direct Gradient Ascent

16 years 13 days ago

Download reference.kfupm.edu.sa

This paper discusses theoretical and experimental aspects of gradient-based approaches to the direct optimization of policy performance in controlled ??? ?s. We introduce ??? ?, a...

Jonathan Baxter, Peter L. Bartlett

claim paper

Read More »

click to vote

NIPS
2001

144views Information Technology» more NIPS 2001»

Variance Reduction Techniques for Gradient Estimates in Reinforcement Learning

15 years 1 months ago

Download jmlr.csail.mit.edu

Policy gradient methods for reinforcement learning avoid some of the undesirable properties of the value function approaches, such as policy degradation (Baxter and Bartlett, 2001...

Evan Greensmith, Peter L. Bartlett, Jonathan Baxte...

claim paper

Read More »

« Prev « First page 26 / 36 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers