Search Sciweavers | Sciweavers

176 search results - page 29 / 36

» On the Controller Synthesis for Finite-State Markov Decision...

click to vote

NIPS
2001

144views Information Technology» more NIPS 2001»

Variance Reduction Techniques for Gradient Estimates in Reinforcement Learning

15 years 1 months ago

Download jmlr.csail.mit.edu

Policy gradient methods for reinforcement learning avoid some of the undesirable properties of the value function approaches, such as policy degradation (Baxter and Bartlett, 2001...

Evan Greensmith, Peter L. Bartlett, Jonathan Baxte...

claim paper

Read More »

click to vote

IJCAI
2003

173views Artificial Intelligence» more IJCAI 2003»

A Planning Algorithm for Predictive State Representations

15 years 1 months ago

Download dli.iiit.ac.in

We address the problem of optimally controlling stochastic environments that are partially observable. The standard method for tackling such problems is to define and solve a Part...

Masoumeh T. Izadi, Doina Precup

claim paper

Read More »

105

click to vote

TMC
2011

219views more TMC 2011»

Optimal Channel Access Management with QoS Support for Cognitive Vehicular Networks

14 years 6 months ago

Download www3.ntu.edu.sg

We consider the problem of optimal channel access to provide quality of service (QoS) for data transmission in cognitive vehicular networks. In such a network the vehicular nodes ...

Dusit Niyato, Ekram Hossain, Ping Wang

claim paper

Read More »

100

click to vote

ICMLA
2009

181views Machine Learning» more ICMLA 2009»

Sensitivity Analysis of POMDP Value Functions

14 years 9 months ago

Download www.cs.cmu.edu

In sequential decision making under uncertainty, as in many other modeling endeavors, researchers observe a dynamical system and collect data measuring its behavior over time. The...

Stéphane Ross, Masoumeh T. Izadi, Mark Merc...

claim paper

Read More »

107

click to vote

ATAL
2010
Springer

128views Intelligent Agents» more ATAL 2010»

Approximate dynamic programming with affine ADDs

14 years 6 months ago

Download eprints.pascal-network.org

The Affine ADD (AADD) is an extension of the Algebraic Decision Diagram (ADD) that compactly represents context-specific, additive and multiplicative structure in functions from a...

Scott Sanner, William T. B. Uther, Karina Valdivia...

claim paper

Read More »

« Prev « First page 29 / 36 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers