Sciweavers

1227 search results - page 41 / 246
» Learning Rates for Q-Learning
Sort
View
ML
2000
ACM
126views Machine Learning» more  ML 2000»
14 years 9 months ago
Learning to Play Chess Using Temporal Differences
In this paper we present TDLEAF( ), a variation on the TD( ) algorithm that enables it to be used in conjunction with game-tree search. We present some experiments in which our che...
Jonathan Baxter, Andrew Tridgell, Lex Weaver
TIT
2008
66views more  TIT 2008»
14 years 9 months ago
State Amplification
We consider the problem of transmitting data at rate over a state-dependent channel with state information available at the sender and at the same time conveying the information ab...
Young-Han Kim, Arak Sutivong, Thomas M. Cover
HRI
2009
ACM
14 years 7 months ago
Evaluating the ICRA 2008 HRI challenge
This paper reports on the evaluation of the ICRA 2008 Human-Robot Interaction (HRI) Challenge. Five research groups demonstrated state-of-the-art work on HRI with a special focus ...
Astrid Weiss, Thomas Scherndl, Manfred Tscheligi, ...
CVPR
2010
IEEE
15 years 6 months ago
Online-Batch Strongly Convex Multi Kernel Learning
Several object categorization algorithms use kernel methods over multiple cues, as they offer a principled approach to combine multiple cues, and to obtain state-of-theart perform...
Francesco Orabona, Jie Luo, Barbara Caputo
90
Voted
GLOBECOM
2006
IEEE
15 years 3 months ago
Adaptive Learning of Transmission Control Policies for MIMO Fading Channels under Delay Constraint
— This paper addresses learning based adaptive resource allocation for wireless MIMO channels with Markovian fading. The problem is posed as Constrained Markov Decision Process w...
Dejan V. Djonin, Vikram Krishnamurthy