Search Sciweavers | Sciweavers

771 search results - page 7 / 155

» Markov Decision Processes with Arbitrary Reward Processes

116

click to vote

ICML
2004
IEEE

120views Machine Learning» more ICML 2004»

Utile distinction hidden Markov models

16 years 2 months ago

Download www.idsia.ch

This paper addresses the problem of constructing good action selection policies for agents acting in partially observable environments, a class of problems generally known as Part...

Daan Wierstra, Marco Wiering

claim paper

Read More »

107

click to vote

ICML
2006
IEEE

142views Machine Learning» more ICML 2006»

An intrinsic reward mechanism for efficient exploration

16 years 2 months ago

Download www-anw.cs.umass.edu

How should a reinforcement learning agent act if its sole purpose is to efficiently learn an optimal policy for later use? In other words, how should it explore, to be able to exp...

Özgür Simsek, Andrew G. Barto

claim paper

Read More »

136

click to vote

IROS
2009
IEEE

206views Robotics» more IROS 2009»

Bayesian reinforcement learning in continuous POMDPs with gaussian processes

15 years 8 months ago

Download www.cs.cmu.edu

— Partially Observable Markov Decision Processes (POMDPs) provide a rich mathematical model to handle realworld sequential decision processes but require a known model to be solv...

Patrick Dallaire, Camille Besse, Stéphane R...

claim paper

Read More »

112

click to vote

ML
2002
ACM

143views Machine Learning» more ML 2002»

A Sparse Sampling Algorithm for Near-Optimal Planning in Large Markov Decision Processes

15 years 1 months ago

Download www.cis.upenn.edu

An issue that is critical for the application of Markov decision processes MDPs to realistic problems is how the complexity of planning scales with the size of the MDP. In stochas...

Michael J. Kearns, Yishay Mansour, Andrew Y. Ng

claim paper

Read More »

111

click to vote

AAAI
1997

119views Intelligent Agents» more AAAI 1997»

Structured Solution Methods for Non-Markovian Decision Processes

15 years 3 months ago

Download www.cs.toronto.edu

Markov Decision Processes (MDPs), currently a popular method for modeling and solving decision theoretic planning problems, are limited by the Markovian assumption: rewards and dy...

Fahiem Bacchus, Craig Boutilier, Adam J. Grove

claim paper

Read More »

« Prev « First page 7 / 155 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers