Search Sciweavers | Sciweavers

664 search results - page 42 / 133

» Combining Reinforcement Learning with a Local Control Algori...

164

click to vote

SPAA
2003
ACM

168views Distributed And Parallel Com...» more SPAA 2003»

On local algorithms for topology control and routing in ad hoc networks

15 years 11 months ago

Download ce.sharif.ac.ir

An ad hoc network is a collection of wireless mobile hosts forming a temporary network without the aid of any ﬁxed infrastructure. Indeed, an important task of an ad hoc network...

Lujun Jia, Rajmohan Rajaraman, Christian Scheidele...

claim paper

Read More »

153

click to vote

NIPS
2004

92views Information Technology» more NIPS 2004»

Responding to Modalities with Different Latencies

15 years 7 months ago

Download books.nips.cc

Motor control depends on sensory feedback in multiple modalities with different latencies. In this paper we consider within the framework of reinforcement learning how different s...

Fredrik Bissmarck, Hiroyuki Nakahara, Kenji Doya, ...

claim paper

Read More »

161

Voted

PRL
1998

87views more PRL 1998»

Global and local neural network ensembles

15 years 6 months ago

Download arantxa.ii.uam.es

Surprisingly simple local learning algorithms are known to outperform many other global non-linear machines. Unfortunately, these algorithms are computationally costly. A means of...

A. Sierra, Carlos Santa Cruz

claim paper

Read More »

173

click to vote

ATAL
2010
Springer

146views Intelligent Agents» more ATAL 2010»

PAC-MDP learning with knowledge-based admissible models

15 years 6 months ago

Download www.aamas-conference.org

PAC-MDP algorithms approach the exploration-exploitation problem of reinforcement learning agents in an effective way which guarantees that with high probability, the algorithm pe...

Marek Grzes, Daniel Kudenko

claim paper

Read More »

178

click to vote

ICML
2010
IEEE

231views Machine Learning» more ICML 2010»

Toward Off-Policy Learning Control with Function Approximation

15 years 7 months ago

Download www.sztaki.hu

We present the first temporal-difference learning algorithm for off-policy control with unrestricted linear function approximation whose per-time-step complexity is linear in the ...

Hamid Reza Maei, Csaba Szepesvári, Shalabh ...

claim paper

Read More »

« Prev « First page 42 / 133 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers