Search Sciweavers | Sciweavers

1227 search results - page 4 / 246

» Learning Rates for Q-Learning

193

click to vote

CORR
2010
Springer

98views Education» more CORR 2010»

Scheduling with Rate Adaptation under Incomplete Knowledge of Channel/Estimator Statistics

15 years 4 months ago

Download www2.ece.ohio-state.edu

In time-varying wireless networks, the states of the communication channels are subject to random variations, and hence need to be estimated for efficient rate adaptation and sched...

Wenzhuo Ouyang, Sugumar Murugesan, Atilla Eryilmaz...

claim paper

Read More »

201

click to vote

JSAC
2011

159views more JSAC 2011»

An Anti-Jamming Stochastic Game for Cognitive Radio Networks

15 years 1 months ago

Download sig.umd.edu

—Various spectrum management schemes have been proposed in recent years to improve the spectrum utilization in cognitive radio networks. However, few of them have considered the ...

Beibei Wang, Yongle Wu, K. J. Ray Liu, T. Charles ...

claim paper

Read More »

159

click to vote

COLT
2001
Springer

84views Machine Learning» more COLT 2001»

Learning Rates for Q-Learning

15 years 11 months ago

Download www.ai.mit.edu

In this paper we derive convergence rates for Q-learning. We show an interesting relationship between the convergence rate and the learning rate used in Q-learning. For a polynomi...

Eyal Even-Dar, Yishay Mansour

claim paper

Read More »

228

click to vote

ECML
2006
Springer

148views Machine Learning» more ECML 2006»

Constant Rate Approximate Maximum Margin Algorithms

15 years 10 months ago

Download eprints.ecs.soton.ac.uk

We present a new class of perceptron-like algorithms with margin in which the "effective" learning rate, defined as the ratio of the learning rate to the length of the we...

Petroula Tsampouka, John Shawe-Taylor

claim paper

Read More »

249

click to vote

AI
2002
Springer

171views Artificial Intelligence» more AI 2002»

Multiagent learning using a variable learning rate

15 years 6 months ago

Download www.cs.cmu.edu

Learning to act in a multiagent environment is a difficult problem since the normal definition of an optimal policy no longer applies. The optimal policy at any moment depends on ...

Michael H. Bowling, Manuela M. Veloso

claim paper

Read More »

« Prev « First page 4 / 246 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers