Sciweavers

252

Publication

222views

Algorithms and Bounds for Rollout Sampling Approximate Policy Iteration

16 years 4 months ago

Abstract: Several approximate policy iteration schemes without value functions, which focus on policy representation using classifiers and address policy learning as a supervis...

Christos Dimitrakakis, Michail G. Lagoudakis

posted by olethros

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers