Search Sciweavers | Sciweavers

5 search results - page 1 / 1

» Expected Mistake Bound Model for On-Line Reinforcement Learn...

155

Voted

ICML
1997
IEEE

135views Machine Learning» more ICML 1997»

Expected Mistake Bound Model for On-Line Reinforcement Learning

16 years 8 months ago

Download www.cs.ualberta.ca

Claude-Nicolas Fiechter

claim paper

Read More »

223

click to vote

EWRL
2008

186views Machine Learning» more EWRL 2008»

Efficient Reinforcement Learning in Parameterized Models: Discrete Parameter Case

15 years 9 months ago

Download webee.technion.ac.il

We consider reinforcement learning in the parameterized setup, where the model is known to belong to a parameterized family of Markov Decision Processes (MDPs). We further impose ...

Kirill Dyagilev, Shie Mannor, Nahum Shimkin

claim paper

Read More »

184

click to vote

CORR
2000
Springer

92views Education» more CORR 2000»

Predicting the expected behavior of agents that learn about agents: the CLRI framework

15 years 7 months ago

Download jmvidal.cse.sc.edu

We describe a framework and equations used to model and predict the behavior of multi-agent systems (MASs) with learning agents. A difference equation is used for calculating the ...

José M. Vidal, Edmund H. Durfee

claim paper

Read More »

243

click to vote

ML
2008
ACM

152views Machine Learning» more ML 2008»

Learning near-optimal policies with Bellman-residual minimization based fitted policy iteration and a single sample path

15 years 7 months ago

Download hal.inria.fr

Abstract. We consider batch reinforcement learning problems in continuous space, expected total discounted-reward Markovian Decision Problems. As opposed to previous theoretical wo...

András Antos, Csaba Szepesvári, R&ea...

claim paper

Read More »

190

click to vote

ICML
2009
IEEE

155views Machine Learning» more ICML 2009»

Near-Bayesian exploration in polynomial time

16 years 8 months ago

Download ai.stanford.edu

We consider the exploration/exploitation problem in reinforcement learning (RL). The Bayesian approach to model-based RL offers an elegant solution to this problem, by considering...

J. Zico Kolter, Andrew Y. Ng

claim paper

Read More »

« Prev « First page 1 / 1 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers