Sciweavers

2011 search results - page 159 / 403
» Universal Reinforcement Learning
Sort
View
ICML
2001
IEEE
16 years 5 months ago
Off-Policy Temporal Difference Learning with Function Approximation
We introduce the first algorithm for off-policy temporal-difference learning that is stable with linear function approximation. Off-policy learning is of interest because it forms...
Doina Precup, Richard S. Sutton, Sanjoy Dasgupta
136
Voted
ATAL
2004
Springer
15 years 10 months ago
Best-Response Multiagent Learning in Non-Stationary Environments
This paper investigates a relatively new direction in Multiagent Reinforcement Learning. Most multiagent learning techniques focus on Nash equilibria as elements of both the learn...
Michael Weinberg, Jeffrey S. Rosenschein
ICALT
2003
IEEE
15 years 9 months ago
New Approaches to Media-Supported Project Work at the University Level
We present experiences made with a course in applied computer science which was based on the concept of communities of practice. Within the scope of the course “Entrepreneurship...
Ralf Klamma, Matthias Jarke, Markus Rohde, Volker ...
CSREAEEE
2006
141views Business» more  CSREAEEE 2006»
15 years 6 months ago
Integrating the Learning Management System with other Online Administrative Systems at AOU
- This paper follows the progress of improving the Arab Open University's Learning Management System by integrating it with other online systems, such as the university's...
Bayan Abu Shawar, Jehad Al-Sadi, Amr Hourani
ICML
1994
IEEE
15 years 8 months ago
Learning Without State-Estimation in Partially Observable Markovian Decision Processes
Reinforcement learning (RL) algorithms provide a sound theoretical basis for building learning control architectures for embedded agents. Unfortunately all of the theory and much ...
Satinder P. Singh, Tommi Jaakkola, Michael I. Jord...