Search Sciweavers | Sciweavers

52 search results - page 4 / 11

» Error Bounds for Approximate Policy Iteration

203

click to vote

TIT
2010

115views Education» more TIT 2010»

On resource allocation in fading multiple-access channels-an efficient approximate projection approach

15 years 2 months ago

Download web.mit.edu

We consider the problem of rate and power allocation in a multiple-access channel. Our objective is to obtain rate and power allocation policies that maximize a general concave ut...

Ali ParandehGheibi, Atilla Eryilmaz, Asuman E. Ozd...

claim paper

Read More »

205

click to vote

ICML
2007
IEEE

139views Machine Learning» more ICML 2007»

Multi-armed bandit problems with dependent arms

16 years 8 months ago

Download www.cs.cmu.edu

We provide a framework to exploit dependencies among arms in multi-armed bandit problems, when the dependencies are in the form of a generative model on clusters of arms. We find ...

Sandeep Pandey, Deepayan Chakrabarti, Deepak Agarw...

claim paper

Read More »

195

Voted

JMLR
2006

143views more JMLR 2006»

Geometric Variance Reduction in Markov Chains: Application to Value Function and Gradient Estimation

15 years 7 months ago

Download www.aaai.org

We study a sequential variance reduction technique for Monte Carlo estimation of functionals in Markov Chains. The method is based on designing sequential control variates using s...

Rémi Munos

claim paper

Read More »

173

Voted

SIAMJO
2008

93views more SIAMJO 2008»

Smooth Optimization with Approximate Gradient

15 years 7 months ago

Download www.princeton.edu

We show that the optimal complexity of Nesterov's smooth first-order optimization algorithm is preserved when the gradient is only computed up to a small, uniformly bounded er...

Alexandre d'Aspremont

claim paper

Read More »

222

click to vote

AIPS
2010

174views Artificial Intelligence» more AIPS 2010»

When Policies Can Be Trusted: Analyzing a Criteria to Identify Optimal Policies in MDPs with Unknown Model Parameters

15 years 10 months ago

Download www.cs.berkeley.edu

Computing a good policy in stochastic uncertain environments with unknown dynamics and reward model parameters is a challenging task. In a number of domains, ranging from space ro...

Emma Brunskill

claim paper

Read More »

« Prev « First page 4 / 11 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers