Search Sciweavers | Sciweavers

1227 search results - page 41 / 246

» Learning Rates for Q-Learning

click to vote

ML
2000
ACM

126views Machine Learning» more ML 2000»

Learning to Play Chess Using Temporal Differences

14 years 9 months ago

Download www.cs.princeton.edu

In this paper we present TDLEAF( ), a variation on the TD( ) algorithm that enables it to be used in conjunction with game-tree search. We present some experiments in which our che...

Jonathan Baxter, Andrew Tridgell, Lex Weaver

claim paper

Read More »

click to vote

TIT
2008

66views more TIT 2008»

State Amplification

14 years 9 months ago

Download www.stanford.edu

We consider the problem of transmitting data at rate over a state-dependent channel with state information available at the sender and at the same time conveying the information ab...

Young-Han Kim, Arak Sutivong, Thomas M. Cover

claim paper

Read More »

110

click to vote

HRI
2009
ACM

203views Human Computer Interaction» more HRI 2009»

Evaluating the ICRA 2008 HRI challenge

14 years 7 months ago

Download robot-at-cwe.eu

This paper reports on the evaluation of the ICRA 2008 Human-Robot Interaction (HRI) Challenge. Five research groups demonstrated state-of-the-art work on HRI with a special focus ...

Astrid Weiss, Thomas Scherndl, Manfred Tscheligi, ...

claim paper

Read More »

102

click to vote

CVPR
2010
IEEE

296views Computer Vision» more CVPR 2010»

Online-Batch Strongly Convex Multi Kernel Learning

15 years 6 months ago

Download francesco.orabona.com

Several object categorization algorithms use kernel methods over multiple cues, as they offer a principled approach to combine multiple cues, and to obtain state-of-theart perform...

Francesco Orabona, Jie Luo, Barbara Caputo

claim paper

Read More »

Voted

GLOBECOM
2006
IEEE

160views Communications» more GLOBECOM 2006»

Adaptive Learning of Transmission Control Policies for MIMO Fading Channels under Delay Constraint

15 years 3 months ago

Download www.ece.ubc.ca

— This paper addresses learning based adaptive resource allocation for wireless MIMO channels with Markovian fading. The problem is posed as Constrained Markov Decision Process w...

Dejan V. Djonin, Vikram Krishnamurthy

claim paper

Read More »

« Prev « First page 41 / 246 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers