Search Sciweavers | Sciweavers

102 search results - page 1 / 21

» MDPs with Non-Deterministic Policies

click to vote

NIPS
2008

171views Information Technology» more NIPS 2008»

MDPs with Non-Deterministic Policies

13 years 6 months ago

Download www.cs.mcgill.ca

Markov Decision Processes (MDPs) have been extensively studied and used in the context of planning and decision-making, and many methods exist to find the optimal policy for probl...

Mahdi Milani Fard, Joelle Pineau

claim paper

Read More »

click to vote

JAIR
2011

144views more JAIR 2011»

Non-Deterministic Policies in Markovian Decision Processes

12 years 11 months ago

Download www.jair.org

Markovian processes have long been used to model stochastic environments. Reinforcement learning has emerged as a framework to solve sequential planning and decision-making proble...

Mahdi Milani Fard, Joelle Pineau

claim paper

Read More »

click to vote

AIPS
2011

216views Artificial Intelligence» more AIPS 2011»

Heuristic Search for Generalized Stochastic Shortest Path MDPs

12 years 8 months ago

Download www.cs.washington.edu

Research in efﬁcient methods for solving inﬁnite-horizon MDPs has so far concentrated primarily on discounted MDPs and the more general stochastic shortest path problems (SSPs...

Andrey Kolobov, Mausam, Daniel S. Weld, Hector Gef...

claim paper

Read More »

click to vote

NIPS
2003

196views Information Technology» more NIPS 2003»

Approximate Policy Iteration with a Policy Language Bias

13 years 6 months ago

Download www.jair.org

We study an approach to policy selection for large relational Markov Decision Processes (MDPs). We consider a variant of approximate policy iteration (API) that replaces the usual...

Alan Fern, Sung Wook Yoon, Robert Givan

claim paper

Read More »

click to vote

AAAI
2010

136views Intelligent Agents» more AAAI 2010»

Robust Policy Computation in Reward-Uncertain MDPs Using Nondominated Policies

13 years 6 months ago

Download www.cs.toronto.edu

The precise specification of reward functions for Markov decision processes (MDPs) is often extremely difficult, motivating research into both reward elicitation and the robust so...

Kevin Regan, Craig Boutilier

claim paper

Read More »

« Prev « First page 1 / 21 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers