Search Sciweavers | Sciweavers

162 search results - page 21 / 33

» Topological Value Iteration Algorithm for Markov Decision Pr...

119

click to vote

ICASSP
2009
IEEE

125views Signal Processing» more ICASSP 2009»

Fast belief propagation process element for high-quality stereo estimation

15 years 1 months ago

Download mpac.ee.ntu.edu.tw

Belief propagation is a popular global optimization technique for many computer vision problems. However, it requires extensive computation due to the iterative message passing op...

Chao-Chung Cheng, Chia-Kai Liang, Yen-Chieh Lai, H...

claim paper

Read More »

109

click to vote

CDC
2008
IEEE

118views Control Systems» more CDC 2008»

A density projection approach to dimension reduction for continuous-state POMDPs

15 years 9 months ago

Download netfiles.uiuc.edu

Abstract— Research on numerical solution methods for partially observable Markov decision processes (POMDPs) has primarily focused on discrete-state models, and these algorithms ...

Enlu Zhou, Michael C. Fu, Steven I. Marcus

claim paper

Read More »

139

click to vote

CONCUR
2006
Springer

159views Distributed And Parallel Com...» more CONCUR 2006»

Strategy Improvement for Stochastic Rabin and Streett Games

15 years 7 months ago

Download mtc.epfl.ch

A stochastic graph game is played by two players on a game graph with probabilistic transitions. We consider stochastic graph games with -regular winning conditions specified as Ra...

Krishnendu Chatterjee, Thomas A. Henzinger

claim paper

Read More »

133

click to vote

ATAL
2006
Springer

157views Intelligent Agents» more ATAL 2006»

Decentralized planning under uncertainty for teams of communicating agents

15 years 7 months ago

Download www.cs.cmu.edu

Decentralized partially observable Markov decision processes (DEC-POMDPs) form a general framework for planning for groups of cooperating agents that inhabit a stochastic and part...

Matthijs T. J. Spaan, Geoffrey J. Gordon, Nikos A....

claim paper

Read More »

122

click to vote

NIPS
2000

127views Information Technology» more NIPS 2000»

Using Free Energies to Represent Q-values in a Multiagent Reinforcement Learning Task

15 years 4 months ago

Download members.chello.at

The problem of reinforcement learning in large factored Markov decision processes is explored. The Q-value of a state-action pair is approximated by the free energy of a product o...

Brian Sallans, Geoffrey E. Hinton

claim paper

Read More »

« Prev « First page 21 / 33 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers