Search Sciweavers | Sciweavers

651 search results - page 101 / 131

» Algorithms for Inverse Reinforcement Learning

172

click to vote

IBERAMIA
2004
Springer

168views Artificial Intelligence» more IBERAMIA 2004»

Mobile Robotic Supported Collaborative Learning (MRSCL)

16 years 2 days ago

Download www2.ing.puc.cl

In this paper we describe MRSCL Geometry a collaborative educational activity that explores the use of robotic technology and wirelessly connected Pocket PCs as tools for teaching ...

Rubén Mitnik, Miguel Nussbaum, Alvaro Soto

claim paper

Read More »

197

click to vote

ICML
2010
IEEE

231views Machine Learning» more ICML 2010»

Toward Off-Policy Learning Control with Function Approximation

15 years 7 months ago

Download www.sztaki.hu

We present the first temporal-difference learning algorithm for off-policy control with unrestricted linear function approximation whose per-time-step complexity is linear in the ...

Hamid Reza Maei, Csaba Szepesvári, Shalabh ...

claim paper

Read More »

205

click to vote

ICML
1999
IEEE

152views Machine Learning» more ICML 1999»

Distributed Value Functions

16 years 7 months ago

Download www.ri.cmu.edu

Many interesting problems, such as power grids, network switches, and tra c ow, that are candidates for solving with reinforcement learningRL, alsohave properties that make distri...

Jeff G. Schneider, Weng-Keen Wong, Andrew W. Moore...

claim paper

Read More »

143

click to vote

CACM
2010

105views more CACM 2010»

Censored exploration and the dark pool problem

15 years 6 months ago

Download www.cis.upenn.edu

We introduce and analyze a natural algorithm for multi-venue exploration from censored data, which is motivated by the Dark Pool Problem of modern quantitative finance. We prove t...

Kuzman Ganchev, Yuriy Nevmyvaka, Michael Kearns, J...

claim paper

Read More »

176

click to vote

NIPS
1993

128views Information Technology» more NIPS 1993»

Convergence of Stochastic Iterative Dynamic Programming Algorithms

15 years 8 months ago

Download www.bitsavers.org

Recent developments in the area of reinforcement learning have yielded a number of new algorithms for the prediction and control of Markovian environments. These algorithms,includ...

Tommi Jaakkola, Michael I. Jordan, Satinder P. Sin...

claim paper

Read More »

« Prev « First page 101 / 131 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers