Search Sciweavers | Sciweavers

46 search results - page 3 / 10

» A Sparse Sampling Algorithm for Near-Optimal Planning in Lar...

click to vote

ICML
2010
IEEE

223views Machine Learning» more ICML 2010»

Feature Selection Using Regularization in Approximate Linear Programs for Markov Decision Processes

13 years 6 months ago

Download anytime.cs.umass.edu

Approximate dynamic programming has been used successfully in a large variety of domains, but it relies on a small set of provided approximation features to calculate solutions re...

Marek Petrik, Gavin Taylor, Ronald Parr, Shlomo Zi...

claim paper

Read More »

click to vote

ECML
2005
Springer

143views Machine Learning» more ECML 2005»

Active Learning in Partially Observable Markov Decision Processes

13 years 11 months ago

Download www.cs.mcgill.ca

This paper examines the problem of ﬁnding an optimal policy for a Partially Observable Markov Decision Process (POMDP) when the model is not known or is only poorly speciﬁed. W...

Robin Jaulmes, Joelle Pineau, Doina Precup

claim paper

Read More »

click to vote

UAI
2000

136views Artificial Intelligence» more UAI 2000»

Fast Planning in Stochastic Games

13 years 6 months ago

Download www.cis.upenn.edu

Stochastic games generalize Markov decision processes MDPs to a multiagent setting by allowing the state transitions to depend jointly on all player actions, and having rewards de...

Michael J. Kearns, Yishay Mansour, Satinder P. Sin...

claim paper

Read More »

click to vote

ICML
2003
IEEE

124views Machine Learning» more ICML 2003»

Exploration in Metric State Spaces

14 years 6 months ago

Download www.cis.upenn.edu

We present metric?? , a provably near-optimal algorithm for reinforcement learning in Markov decision processes in which there is a natural metric on the state space that allows t...

Sham Kakade, Michael J. Kearns, John Langford

claim paper

Read More »

click to vote

AAAI
2011

145views Intelligent Agents» more AAAI 2011»

Policy Gradient Planning for Environmental Decision Making with Existing Simulators

12 years 5 months ago

Download www.cs.ubc.ca

In environmental and natural resource planning domains actions are taken at a large number of locations over multiple time periods. These problems have enormous state and action s...

Mark Crowley, David Poole

claim paper

Read More »

« Prev « First page 3 / 10 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers