Search Sciweavers | Sciweavers

1138 search results - page 74 / 228

» Feature Markov Decision Processes

152

click to vote

AAAI
2010

201views Intelligent Agents» more AAAI 2010»

Compressing POMDPs Using Locality Preserving Non-Negative Matrix Factorization

15 years 6 months ago

Download www.cs.umass.edu

Partially Observable Markov Decision Processes (POMDPs) are a well-established and rigorous framework for sequential decision-making under uncertainty. POMDPs are well-known to be...

Georgios Theocharous, Sridhar Mahadevan

claim paper

Read More »

140

click to vote

ICMLA
2008

106views Machine Learning» more ICMLA 2008»

Prediction-Directed Compression of POMDPs

15 years 6 months ago

Download damas.ift.ulaval.ca

High dimensionality of belief space in Partially Observable Markov Decision Processes (POMDPs) is one of the major causes that severely restricts the applicability of this model. ...

Abdeslam Boularias, Masoumeh T. Izadi, Brahim Chai...

claim paper

Read More »

130

click to vote

FLAIRS
2006

101views Artificial Intelligence» more FLAIRS 2006»

Stochastic Deliberation Scheduling using GSMDPs

15 years 6 months ago

Download www.aaai.org

We propose a new decision-theoretic approach for solving execution-time deliberation scheduling problems using recent advances in Generalized Semi-Markov Decision Processes (GSMDP...

Kurt D. Krebsbach

claim paper

Read More »

147

click to vote

IJCAI
2001

185views Artificial Intelligence» more IJCAI 2001»

Symbolic Dynamic Programming for First-Order MDPs

15 years 6 months ago

Download www.cs.toronto.edu

We present a dynamic programming approach for the solution of first-order Markov decisions processes. This technique uses an MDP whose dynamics is represented in a variant of the ...

Craig Boutilier, Raymond Reiter, Bob Price

claim paper

Read More »

137

click to vote

ML
2002
ACM

121views Machine Learning» more ML 2002»

Near-Optimal Reinforcement Learning in Polynomial Time

15 years 4 months ago

Download www.cis.upenn.edu

We present new algorithms for reinforcement learning, and prove that they have polynomial bounds on the resources required to achieve near-optimal return in general Markov decisio...

Michael J. Kearns, Satinder P. Singh

claim paper

Read More »

« Prev « First page 74 / 228 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers