Search Sciweavers | Sciweavers

164 search results - page 31 / 33

» Self-Optimizing Memory Controllers: A Reinforcement Learning...

click to vote

IROS
2006
IEEE

126views Robotics» more IROS 2006»

A System for Robotic Heart Surgery that Learns to Tie Knots Using Recurrent Neural Networks

13 years 11 months ago

Download www.idsia.ch

Abstract— Tying suture knots is a time-consuming task performed frequently during Minimally Invasive Surgery (MIS). Automating this task could greatly reduce total surgery time f...

Hermann Georg Mayer, Faustino J. Gomez, Daan Wiers...

claim paper

Read More »

click to vote

ATAL
2009
Springer

170views Intelligent Agents» more ATAL 2009»

Bounded rationality via recursion

13 years 12 months ago

Download www.ifaamas.org

Current trends in model construction in the ﬁeld of agentbased computational economics base behavior of agents on either game theoretic procedures (e.g. belief learning, ﬁctit...

Maciej Latek, Robert L. Axtell, Bogumil Kaminski

claim paper

Read More »

click to vote

WOWMOM
2005
ACM

240views Multimedia» more WOWMOM 2005»

An Adaptive Routing Protocol for Ad Hoc Peer-to-Peer Networks

13 years 11 months ago

Download sixearch.org

Ad hoc networks represent a key factor in the evolution of wireless communications. These networks typically consist of equal nodes that communicate without central control, inter...

Luca Gatani, Giuseppe Lo Re, Salvatore Gaglio

claim paper

Read More »

click to vote

UAI
2008

242views Artificial Intelligence» more UAI 2008»

Dyna-Style Planning with Linear Function Approximation and Prioritized Sweeping

13 years 6 months ago

Download uai2008.cs.helsinki.fi

We consider the problem of efficiently learning optimal control policies and value functions over large state spaces in an online setting in which estimates must be available afte...

Richard S. Sutton, Csaba Szepesvári, Alborz...

claim paper

Read More »

click to vote

NIPS
1998

164views Information Technology» more NIPS 1998»

Finite-Sample Convergence Rates for Q-Learning and Indirect Algorithms

13 years 6 months ago

Download www.cis.upenn.edu

In this paper, we address two issues of long-standing interest in the reinforcement learning literature. First, what kinds of performance guarantees can be made for Q-learning aft...

Michael J. Kearns, Satinder P. Singh

claim paper

Read More »

« Prev « First page 31 / 33 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers