Search Sciweavers | Sciweavers

141 search results - page 28 / 29

» CBR for State Value Function Approximation in Reinforcement ...

click to vote

ICML
2010
IEEE

247views Machine Learning» more ICML 2010»

Inverse Optimal Control with Linearly-Solvable MDPs

13 years 6 months ago

Download www.cs.washington.edu

We present new algorithms for inverse optimal control (or inverse reinforcement learning, IRL) within the framework of linearlysolvable MDPs (LMDPs). Unlike most prior IRL algorit...

Dvijotham Krishnamurthy, Emanuel Todorov

claim paper

Read More »

click to vote

CIMCA
2008
IEEE

125views Intelligent Agents» more CIMCA 2008»

Tree Exploration for Bayesian RL Exploration

14 years 6 days ago

Download arxiv.org

Research in reinforcement learning has produced algorithms for optimal decision making under uncertainty that fall within two main types. The ﬁrst employs a Bayesian framework, ...

Christos Dimitrakakis

posted by olethros

Read More »

click to vote

CORR
2008
Springer

98views Education» more CORR 2008»

Information Acquisition and Exploitation in Multichannel Wireless Networks

13 years 5 months ago

Download www.cis.upenn.edu

A wireless system with multiple channels is considered, where each channel has several transmission states. A user learns about the instantaneous state of an available channel by ...

Sudipto Guha, Kamesh Munagala, Saswati Sarkar

claim paper

Read More »

click to vote

RSS
2007

176views Robotics» more RSS 2007»

Active Policy Learning for Robot Planning and Exploration under Uncertainty

13 years 7 months ago

Download www.roboticsproceedings.org

Abstract— This paper proposes a simulation-based active policy learning algorithm for ﬁnite-horizon, partially-observed sequential decision processes. The algorithm is tested i...

Ruben Martinez-Cantin, Nando de Freitas, Arnaud Do...

claim paper

Read More »

click to vote

JMLR
2010

137views more JMLR 2010»

Importance Sampling for Continuous Time Bayesian Networks

13 years 16 days ago

Download jmlr.csail.mit.edu

A continuous time Bayesian network (CTBN) uses a structured representation to describe a dynamic system with a finite number of states which evolves in continuous time. Exact infe...

Yu Fan, Jing Xu, Christian R. Shelton

claim paper

Read More »

« Prev « First page 28 / 29 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers