Search Sciweavers | Sciweavers

85 search results - page 1 / 17

» Approximate Policy Iteration with a Policy Language Bias

click to vote

NIPS
2003

196views Information Technology» more NIPS 2003»

Approximate Policy Iteration with a Policy Language Bias

13 years 5 months ago

Download www.jair.org

We study an approach to policy selection for large relational Markov Decision Processes (MDPs). We consider a variant of approximate policy iteration (API) that replaces the usual...

Alan Fern, Sung Wook Yoon, Robert Givan

claim paper

Read More »

click to vote

IJCAI
2003

147views Artificial Intelligence» more IJCAI 2003»

Approximate Policy Iteration using Large-Margin Classifiers

13 years 6 months ago

Download ijcai.org

We present an approximate policy iteration algorithm that uses rollouts to estimate the value of each action under a given policy in a subset of states and a classifier to general...

Michail G. Lagoudakis, Ronald Parr

claim paper

Read More »

click to vote

ICML
2010
IEEE

202views Machine Learning» more ICML 2010»

Least-Squares Policy Iteration: Bias-Variance Trade-off in Control Problems

13 years 5 months ago

Download www.icml2010.org

Christophe Thiery, Bruno Scherrer

claim paper

Read More »

click to vote

NN
2010
Springer

187views Neural Networks» more NN 2010»

Efficient exploration through active learning for value function approximation in reinforcement learning

12 years 11 months ago

Download sugiyama-www.cs.titech.ac.jp

Appropriately designing sampling policies is highly important for obtaining better control policies in reinforcement learning. In this paper, we first show that the least-squares ...

Takayuki Akiyama, Hirotaka Hachiya, Masashi Sugiya...

claim paper

Read More »

click to vote

ICML
2003
IEEE

174views Machine Learning» more ICML 2003»

Error Bounds for Approximate Policy Iteration

14 years 5 months ago

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers