reinforcement learning

25

Publication

352views

Efficient methods for near-optimal sequential decision making under uncertainty

14 years 8 days ago

This chapter discusses decision making under uncertainty. More specifically, it offers an overview of efficient Bayesian and distribution-free algorithms for making near-optimal se...

Christos Dimitrakakis

posted by olethros

Read More »

16

click to vote

Publication

222views

Algorithms and Bounds for Rollout Sampling Approximate Policy Iteration

14 years 1 months ago

Download arxiv.org

Abstract: Several approximate policy iteration schemes without value functions, which focus on policy representation using classifiers and address policy learning as a supervis...

Christos Dimitrakakis, Michail G. Lagoudakis

posted by olethros

Read More »

24

click to vote

Publication

334views

Rollout Sampling Approximate Policy Iteration

14 years 1 months ago

Download www.springerlink.com

Several researchers have recently investigated the connection between reinforcement learning and classification. We are motivated by proposals of approximate policy iteration schem...

Christos Dimitrakakis, Michail G. Lagoudakis

posted by olethros

Read More »

27
posts

with
8473
views

1430profile views Browse My Posts »

olethrosPostdoctoral

EPFL

Homepage lia.epfl.ch

Bayesian Reinforcement Learning | Complexity Analysis | Decision Theory | Intrusion Detection | Learning In Games | Machine Learning | Partially Observable Stochastic Games | POMDPs | Regret Bounds | Reinforcement Learning | Stochastic Optimization |

posted by olethros Mar 14 2010

Read More »

32

click to vote

ICAART
2010
INSTICC

509views Intelligent Agents» more ICAART 2010»

Complexity of Stochastic Branch and Bound Methods for Belief Tree Search in Bayesian Reinforcement Learning

14 years 1 months ago

Download arxiv.org

There has been a lot of recent work on Bayesian methods for reinforcement learning exhibiting near-optimal online performance. The main obstacle facing such methods is that in most...

Christos Dimitrakakis

posted by olethros

Read More »

2
posts

with
280
views

299profile views Browse My Posts »

Daniel L. ElliottStudent, PhD

Colorado State University

Homepage

I am a Ph.D. student working with Chuck Anderson studying reinforcement learning. I also enjoy high dimensional data issues, mixture models, neural networks, and simple, yet effec...

Curriculum Vitae (CV)

Computer Vision | Reinforcement Learning |

posted by danelliottster Nov 23 2009

Read More »

15

click to vote

ICML
1999
IEEE

138views Machine Learning» more ICML 1999»

Using Reinforcement Learning to Spider the Web Efficiently

14 years 5 months ago

Download www.cs.iastate.edu

Consider the task of exploring the Web in order to find pages of a particular kind or on a particular topic. This task arises in the construction of search engines and Web knowled...

Jason Rennie, Andrew McCallum

claim paper

Read More »

11

click to vote

ICML
2000
IEEE

155views Machine Learning» more ICML 2000»

Combining Reinforcement Learning with a Local Control Algorithm

14 years 5 months ago

Download www-anw.cs.umass.edu

We explore combining reinforcement learning with a hand-crafted local controller in a manner suggested by the chaotic control algorithm of Vincent, Schmitt and Vincent (1994). A c...

Andrew G. Barto, Jette Randløv, Michael T. ...

claim paper

Read More »

10

click to vote

ICML
2004
IEEE

161views Machine Learning» more ICML 2004»

Using relative novelty to identify useful temporal abstractions in reinforcement learning

14 years 5 months ago

Download www.cs.umass.edu

lative Novelty to Identify Useful Temporal Abstractions in Reinforcement Learning ?Ozg?ur S?im?sek ozgur@cs.umass.edu Andrew G. Barto barto@cs.umass.edu Department of Computer Scie...

Özgür Simsek, Andrew G. Barto

claim paper

Read More »

12

click to vote

ICML
2004
IEEE

156views Machine Learning» more ICML 2004»

Learning to fly by combining reinforcement learning with behavioural cloning

14 years 5 months ago

Download ccc.inaoep.mx

Reinforcement learning deals with learning optimal or near optimal policies while interacting with the environment. Application domains with many continuous variables are difficul...

Eduardo F. Morales, Claude Sammut

claim paper

Read More »

Sciweavers

Bayesian Reinforcement Learning | Complexity Analysis | Decision Theory | Intrusion Detection | Learning In Games | Machine Learning | Partially Observable Stochastic Games | POMDPs | Regret Bounds | Reinforcement Learning | Stochastic Optimization |

Computer Vision | Reinforcement Learning |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers