Search Sciweavers | Sciweavers

23 search results - page 5 / 5

» Percentile optimization in uncertain Markov decision process...

click to vote

JMLR
2010

140views more JMLR 2010»

Mean Field Variational Approximation for Continuous-Time Bayesian Networks

12 years 11 months ago

Download www.cs.mcgill.ca

Continuous-time Bayesian networks is a natural structured representation language for multicomponent stochastic processes that evolve continuously over time. Despite the compact r...

Ido Cohn, Tal El-Hay, Nir Friedman, Raz Kupferman

claim paper

Read More »

click to vote

AAAI
2010

185views Intelligent Agents» more AAAI 2010»

PUMA: Planning Under Uncertainty with Macro-Actions

13 years 6 months ago

Download www.cs.berkeley.edu

Planning in large, partially observable domains is challenging, especially when a long-horizon lookahead is necessary to obtain a good policy. Traditional POMDP planners that plan...

Ruijie He, Emma Brunskill, Nicholas Roy

claim paper

Read More »

click to vote

ICML
1999
IEEE

168views Machine Learning» more ICML 1999»

Least-Squares Temporal Difference Learning

14 years 5 months ago

Download www.research.rutgers.edu

Excerpted from: Boyan, Justin. Learning Evaluation Functions for Global Optimization. Ph.D. thesis, Carnegie Mellon University, August 1998. (Available as Technical Report CMU-CS-...

Justin A. Boyan

claim paper

Read More »

« Prev « First page 5 / 5 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers