Search Sciweavers | Sciweavers

334 search results - page 54 / 67

» How to Dynamically Merge Markov Decision Processes

130

click to vote

JMLR
2006

190views more JMLR 2006»

Causal Graph Based Decomposition of Factored MDPs

15 years 1 months ago

Download www-anw.cs.umass.edu

We present Variable Influence Structure Analysis, or VISA, an algorithm that performs hierarchical decomposition of factored Markov decision processes. VISA uses a dynamic Bayesia...

Anders Jonsson, Andrew G. Barto

claim paper

Read More »

116

click to vote

PKDD
2010
Springer

129views Data Mining» more PKDD 2010»

Smarter Sampling in Model-Based Bayesian Reinforcement Learning

14 years 12 months ago

Download www.cs.mcgill.ca

Abstract. Bayesian reinforcement learning (RL) is aimed at making more efﬁcient use of data samples, but typically uses signiﬁcantly more computation. For discrete Markov Decis...

Pablo Samuel Castro, Doina Precup

claim paper

Read More »

130

click to vote

GLOBECOM
2010
IEEE

189views Communications» more GLOBECOM 2010»

Need-Based Communication for Smart Grid: When to Inquire Power Price?

14 years 11 months ago

Download iweb.tntech.edu

In smart grid, a home appliance can adjust its power consumption level according to the realtime power price obtained from communication channels. Most studies on smart grid do not...

Husheng Li, Robert C. Qiu

claim paper

Read More »

139

click to vote

GECCO
2009
Springer

162views Optimization» more GECCO 2009»

Uncertainty handling CMA-ES for reinforcement learning

14 years 11 months ago

Download www.neuroinformatik.ruhr-uni-bochum.de

The covariance matrix adaptation evolution strategy (CMAES) has proven to be a powerful method for reinforcement learning (RL). Recently, the CMA-ES has been augmented with an ada...

Verena Heidrich-Meisner, Christian Igel

claim paper

Read More »

104

click to vote

ICRA
2008
IEEE

128views Robotics» more ICRA 2008»

A point-based POMDP planner for target tracking

15 years 8 months ago

Download www.comp.nus.edu.sg

— Target tracking has two variants that are often studied independently with different approaches: target searching requires a robot to ﬁnd a target initially not visible, and ...

David Hsu, Wee Sun Lee, Nan Rong

claim paper

Read More »

« Prev « First page 54 / 67 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers