Search Sciweavers | Sciweavers

52 search results - page 6 / 11

» Error Bounds for Approximate Policy Iteration

155

click to vote

CDC
2009
IEEE

123views Control Systems» more CDC 2009»

On the myopic policy for a class of restless bandit problems with applications in dynamic multichannel access

15 years 10 months ago

Download www.ece.ucdavis.edu

We consider a class of restless multi-armed bandit problems that arises in multi-channel opportunistic communications, where channels are modeled as independent and stochastically...

Keqin Liu, Qing Zhao

claim paper

Read More »

156

click to vote

CORR
2008
Springer

132views Education» more CORR 2008»

Dynamic Rate Allocation in Fading Multiple-access Channels

15 years 6 months ago

Download web.mit.edu

We consider the problem of rate allocation in a fading Gaussian multiple-access channel (MAC) with fixed transmission powers. Our goal is to maximize a general concave utility func...

Ali ParandehGheibi, Atilla Eryilmaz, Asuman E. Ozd...

claim paper

Read More »

183

click to vote

ICMLA
2010

211views Machine Learning» more ICMLA 2010»

Ensembles of Neural Networks for Robust Reinforcement Learning

15 years 3 months ago

Download ahans.de

Reinforcement learning algorithms that employ neural networks as function approximators have proven to be powerful tools for solving optimal control problems. However, their traini...

Alexander Hans, Steffen Udluft

claim paper

Read More »

151

Voted

CDC
2008
IEEE

120views Control Systems» more CDC 2008»

Approximate abstractions of discrete-time controlled stochastic hybrid systems

16 years 10 days ago

Download hybrid.stanford.edu

ate Abstractions of Discrete-Time Controlled Stochastic Hybrid Systems Alessandro D’Innocenzo, Alessandro Abate, and Maria D. Di Benedetto — This work proposes a procedure to c...

Alessandro D'Innocenzo, Alessandro Abate, Maria Do...

claim paper

Read More »

165

click to vote

MP
2002

176views more MP 2002»

UOBYQA: unconstrained optimization by quadratic approximation

15 years 5 months ago

Download www.ii.uib.no

UOBYQA is a new algorithm for general unconstrained optimization calculations, that takes account of the curvature of the objective function, F say, by forming quadratic models by ...

M. J. D. Powell

claim paper

Read More »

« Prev « First page 6 / 11 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers