Search Sciweavers | Sciweavers

162 search results - page 6 / 33

» Topological Value Iteration Algorithm for Markov Decision Pr...

184

Voted

AI
2006
Springer

167views Artificial Intelligence» more AI 2006»

Belief Selection in Point-Based Planning Algorithms for POMDPs

15 years 10 months ago

Download www.cs.mcgill.ca

Abstract. Current point-based planning algorithms for solving partially observable Markov decision processes (POMDPs) have demonstrated that a good approximation of the value funct...

Masoumeh T. Izadi, Doina Precup, Danielle Azar

claim paper

Read More »

205

click to vote

NN
2010
Springer

187views Neural Networks» more NN 2010»

Efficient exploration through active learning for value function approximation in reinforcement learning

15 years 1 months ago

Download sugiyama-www.cs.titech.ac.jp

Appropriately designing sampling policies is highly important for obtaining better control policies in reinforcement learning. In this paper, we first show that the least-squares ...

Takayuki Akiyama, Hirotaka Hachiya, Masashi Sugiya...

claim paper

Read More »

180

Voted

IJCAI
2001

185views Artificial Intelligence» more IJCAI 2001»

Symbolic Dynamic Programming for First-Order MDPs

15 years 8 months ago

Download www.cs.toronto.edu

We present a dynamic programming approach for the solution of first-order Markov decisions processes. This technique uses an MDP whose dynamics is represented in a variant of the ...

Craig Boutilier, Raymond Reiter, Bob Price

claim paper

Read More »

177

click to vote

ATAL
2007
Springer

185views Intelligent Agents» more ATAL 2007»

On opportunistic techniques for solving decentralized Markov decision processes with temporal constraints

16 years 27 days ago

Download www.aamas-conference.org

Decentralized Markov Decision Processes (DEC-MDPs) are a popular model of agent-coordination problems in domains with uncertainty and time constraints but very difﬁcult to solve...

Janusz Marecki, Milind Tambe

claim paper

Read More »

174

click to vote

CAV
2007
Springer

112views Hardware» more CAV 2007»

Magnifying-Lens Abstraction for Markov Decision Processes

16 years 27 days ago

Download www.ee.ucla.edu

ng-Lens Abstraction for Markov Decision Processes⋆ In Proc. of CAV 2007: 19th International Conference on Computer-Aided Veriﬁcation, Lectures Notes in Computer Science. c Spri...

Luca de Alfaro, Pritam Roy

claim paper

Read More »

« Prev « First page 6 / 33 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers