Search Sciweavers | Sciweavers

162 search results - page 8 / 33

» Topological Value Iteration Algorithm for Markov Decision Pr...

123

click to vote

ICRA
2007
IEEE

154views Robotics» more ICRA 2007»

Oracular Partially Observable Markov Decision Processes: A Very Special Case

15 years 9 months ago

Download www.cs.cmu.edu

— We introduce the Oracular Partially Observable Markov Decision Process (OPOMDP), a type of POMDP in which the world produces no observations; instead there is an “oracle,” ...

Nicholas Armstrong-Crews, Manuela M. Veloso

claim paper

Read More »

157

click to vote

UAI
2000

136views Artificial Intelligence» more UAI 2000»

Fast Planning in Stochastic Games

15 years 4 months ago

Download www.cis.upenn.edu

Stochastic games generalize Markov decision processes MDPs to a multiagent setting by allowing the state transitions to depend jointly on all player actions, and having rewards de...

Michael J. Kearns, Yishay Mansour, Satinder P. Sin...

claim paper

Read More »

124

click to vote

JMLR
2006

143views more JMLR 2006»

Geometric Variance Reduction in Markov Chains: Application to Value Function and Gradient Estimation

15 years 3 months ago

Download www.aaai.org

We study a sequential variance reduction technique for Monte Carlo estimation of functionals in Markov Chains. The method is based on designing sequential control variates using s...

Rémi Munos

claim paper

Read More »

110

click to vote

NIPS
2000

121views Information Technology» more NIPS 2000»

APRICODD: Approximate Policy Construction Using Decision Diagrams

15 years 4 months ago

Download www.cs.ubc.ca

We propose a method of approximate dynamic programming for Markov decision processes (MDPs) using algebraic decision diagrams (ADDs). We produce near-optimal value functions and p...

Robert St-Aubin, Jesse Hoey, Craig Boutilier

claim paper

Read More »

132

click to vote

CORR
2011
Springer

175views Education» more CORR 2011»

Adaptive Channel Recommendation for Dynamic Spectrum Access

14 years 10 months ago

Download home.ie.cuhk.edu.hk

—We propose a dynamic spectrum access scheme where secondary users recommend “good” channels to each other and access accordingly. We formulate the problem as an average rewa...

Xu Chen, Jianwei Huang, Husheng Li

claim paper

Read More »

« Prev « First page 8 / 33 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers