Search Sciweavers | Sciweavers

102 search results - page 5 / 21

» Efficient Asymptotic Approximation in Temporal Difference Le...

115

click to vote

NIPS
2001

206views Information Technology» more NIPS 2001»

Model-Free Least-Squares Policy Iteration

15 years 1 months ago

Download www.cs.duke.edu

We propose a new approach to reinforcement learning which combines least squares function approximation with policy iteration. Our method is model-free and completely off policy. ...

Michail G. Lagoudakis, Ronald Parr

claim paper

Read More »

108

click to vote

NIPS
2008

165views Information Technology» more NIPS 2008»

Regularized Policy Iteration

15 years 1 months ago

Download webdocs.cs.ualberta.ca

In this paper we consider approximate policy-iteration-based reinforcement learning algorithms. In order to implement a flexible function approximation scheme we propose the use o...

Amir Massoud Farahmand, Mohammad Ghavamzadeh, Csab...

claim paper

Read More »

120

click to vote

ATAL
2005
Springer

181views Intelligent Agents» more ATAL 2005»

Improving reinforcement learning function approximators via neuroevolution

15 years 5 months ago

Download www.aaai.org

Reinforcement learning problems are commonly tackled with temporal difference methods, which use dynamic programming and statistical sampling to estimate the long-term value of ta...

Shimon Whiteson

claim paper

Read More »

click to vote

JAIR
2006

137views more JAIR 2006»

Learning Sentence-internal Temporal Relations

14 years 11 months ago

Download www.jair.org

In this paper we propose a data intensive approach for inferring sentence-internal temporal relations. Temporal inference is relevant for practical NLP applications which either e...

Maria Lapata, Alex Lascarides

claim paper

Read More »

click to vote

UAI
1996

120views Artificial Intelligence» more UAI 1996»

Efficient Approximations for the Marginal Likelihood of Incomplete Data Given a Bayesian Network

15 years 28 days ago

Download research.microsoft.com

We discuss Bayesian methods for learning Bayesian networks when data sets are incomplete. In particular, we examine asymptotic approximations for the marginal likelihood of incomp...

David Maxwell Chickering, David Heckerman

claim paper

Read More »

« Prev « First page 5 / 21 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers