Search Sciweavers | Sciweavers

11 search results - page 3 / 3

» Learning in Reactive Environments with Arbitrary Dependence

click to vote

JMLR
2008

129views more JMLR 2008»

Finite-Time Bounds for Fitted Value Iteration

13 years 5 months ago

Download www.sztaki.hu

In this paper we develop a theoretical analysis of the performance of sampling-based fitted value iteration (FVI) to solve infinite state-space, discounted-reward Markovian decisi...

Rémi Munos, Csaba Szepesvári

claim paper

Read More »

« Prev « First page 3 / 3 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers