Search Sciweavers | Sciweavers

3084 search results - page 127 / 617

» Learning to Take Actions

click to vote

IADIS
2003

114views Internet Technology» more IADIS 2003»

E-Blended Learning for Distance Learners

14 years 11 months ago

Download www.iadis.net

E-blended learning as a new methodology will be explained. E-blended learning scenario for distance learners will include live sessions. During the last years we developed e-learn...

Jeanne Schreurs

claim paper

Read More »

click to vote

ICML
2010
IEEE

231views Machine Learning» more ICML 2010»

Toward Off-Policy Learning Control with Function Approximation

14 years 11 months ago

Download www.sztaki.hu

We present the first temporal-difference learning algorithm for off-policy control with unrestricted linear function approximation whose per-time-step complexity is linear in the ...

Hamid Reza Maei, Csaba Szepesvári, Shalabh ...

claim paper

Read More »

112

click to vote

ICML
2000
IEEE

153views Machine Learning» more ICML 2000»

Eligibility Traces for Off-Policy Policy Evaluation

15 years 10 months ago

Download www.cs.ualberta.ca

Eligibility traces have been shown to speed reinforcement learning, to make it more robust to hidden states, and to provide a link between Monte Carlo and temporal-difference meth...

Doina Precup, Richard S. Sutton, Satinder P. Singh

claim paper

Read More »

click to vote

IUI
1997
ACM

112views Software Engineering» more IUI 1997»

Inductive Task Modeling for User Interface Customization

15 years 2 months ago

Download www.iuiconf.org

This paper describes ActionStreams, a system for inducing task models from observations of user activity. The model can represent several task structures: hierarchy, variable sequ...

David Maulsby

claim paper

Read More »

click to vote

ICML
2003
IEEE

151views Machine Learning» more ICML 2003»

Hierarchical Policy Gradient Algorithms

15 years 10 months ago

Download www.hpl.hp.com

Hierarchical reinforcement learning is a general framework which attempts to accelerate policy learning in large domains. On the other hand, policy gradient reinforcement learning...

Mohammad Ghavamzadeh, Sridhar Mahadevan

claim paper

Read More »

« Prev « First page 127 / 617 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers