Search Sciweavers | Sciweavers

168 search results - page 4 / 34

» Reinforcement Learning Algorithm for Partially Observable Ma...

click to vote

ECAI
2008
Springer

158views Artificial Intelligence» more ECAI 2008»

A Simulation-based Approach for Solving Generalized Semi-Markov Decision Processes

13 years 8 months ago

Download emmanuel.rachelson.free.fr

Time is a crucial variable in planning and often requires special attention since it introduces a specific structure along with additional complexity, especially in the case of dec...

Emmanuel Rachelson, Gauthier Quesnel, Fréd&...

claim paper

Read More »

click to vote

ECML
2003
Springer

149views Machine Learning» more ECML 2003»

Could Active Perception Aid Navigation of Partially Observable Grid Worlds?

13 years 11 months ago

Download homepages.inf.ed.ac.uk

Due to the unavoidable fact that a robot’s sensors will be limited in some manner, it is entirely possible that it can ﬁnd itself unable to distinguish between diﬀering state...

Paul A. Crook, Gillian Hayes

claim paper

Read More »

click to vote

ICRA
2007
IEEE

154views Robotics» more ICRA 2007»

Oracular Partially Observable Markov Decision Processes: A Very Special Case

14 years 17 days ago

Download www.cs.cmu.edu

— We introduce the Oracular Partially Observable Markov Decision Process (OPOMDP), a type of POMDP in which the world produces no observations; instead there is an “oracle,” ...

Nicholas Armstrong-Crews, Manuela M. Veloso

claim paper

Read More »

click to vote

CORR
2010
Springer

105views Education» more CORR 2010»

Optimism in Reinforcement Learning Based on Kullback-Leibler Divergence

13 years 4 months ago

Download hal.archives-ouvertes.fr

We consider model-based reinforcement learning in ﬁnite Markov Decision Processes (MDPs), focussing on so-called optimistic strategies. Optimism is usually implemented by carryin...

Sarah Filippi, Olivier Cappé, Aurelien Gari...

claim paper

Read More »

click to vote

AAAI
2011

246views Intelligent Agents» more AAAI 2011»

An Online Spectral Learning Algorithm for Partially Observable Nonlinear Dynamical Systems

12 years 6 months ago

Download www.cs.cmu.edu

Recently, a number of researchers have proposed spectral algorithms for learning models of dynamical systems—for example, Hidden Markov Models (HMMs), Partially Observable Marko...

Byron Boots, Geoffrey J. Gordon

claim paper

Read More »

« Prev « First page 4 / 34 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers