Search Sciweavers | Sciweavers

2011 search results - page 159 / 403

» Universal Reinforcement Learning

151

click to vote

ICML
2001
IEEE

185views Machine Learning» more ICML 2001»

Off-Policy Temporal Difference Learning with Function Approximation

16 years 5 months ago

Download www.cs.ualberta.ca

We introduce the first algorithm for off-policy temporal-difference learning that is stable with linear function approximation. Off-policy learning is of interest because it forms...

Doina Precup, Richard S. Sutton, Sanjoy Dasgupta

claim paper

Read More »

136

Voted

ATAL
2004
Springer

105views Intelligent Agents» more ATAL 2004»

Best-Response Multiagent Learning in Non-Stationary Environments

15 years 10 months ago

Download www.odu.edu

This paper investigates a relatively new direction in Multiagent Reinforcement Learning. Most multiagent learning techniques focus on Nash equilibria as elements of both the learn...

Michael Weinberg, Jeffrey S. Rosenschein

claim paper

Read More »

124

click to vote

ICALT
2003
IEEE

86views Machine Learning» more ICALT 2003»

New Approaches to Media-Supported Project Work at the University Level

15 years 9 months ago

Download www.uni-siegen.de

We present experiences made with a course in applied computer science which was based on the concept of communities of practice. Within the scope of the course “Entrepreneurship...

Ralf Klamma, Matthias Jarke, Markus Rohde, Volker ...

claim paper

Read More »

134

click to vote

CSREAEEE
2006

141views Business» more CSREAEEE 2006»

Integrating the Learning Management System with other Online Administrative Systems at AOU

15 years 6 months ago

Download ww1.ucmss.com

- This paper follows the progress of improving the Arab Open University's Learning Management System by integrating it with other online systems, such as the university's...

Bayan Abu Shawar, Jehad Al-Sadi, Amr Hourani

claim paper

Read More »

152

click to vote

ICML
1994
IEEE

151views Machine Learning» more ICML 1994»

Learning Without State-Estimation in Partially Observable Markovian Decision Processes

15 years 8 months ago

Download www.eecs.umich.edu

Reinforcement learning (RL) algorithms provide a sound theoretical basis for building learning control architectures for embedded agents. Unfortunately all of the theory and much ...

Satinder P. Singh, Tommi Jaakkola, Michael I. Jord...

claim paper

Read More »

« Prev « First page 159 / 403 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers