Sciweavers

2011 search results - page 182 / 403
» Universal Reinforcement Learning
Sort
View
151
Voted
ML
2006
ACM
15 years 4 months ago
Universal parameter optimisation in games based on SPSA
Most game programs have a large number of parameters that are crucial for their performance. While tuning these parameters by hand is rather difficult, efficient and easy to use ge...
Levente Kocsis, Csaba Szepesvári
WWW
2004
ACM
16 years 5 months ago
Integrating learning objects into an open learning environment: evaluation of learning processes in an informatics learning lab
The Didactics of Informatics research group at the University of Paderborn is involved in efforts to design implement and evaluate a web-based learning laboratory for informatics ...
Johannes Magenheim, Olaf Scheel
NIPS
1996
15 years 6 months ago
Why did TD-Gammon Work?
Although TD-Gammon is one of the major successes in machine learning, it has not led to similar impressive breakthroughs in temporal difference learning for other applications or ...
Jordan B. Pollack, Alan D. Blair
GECCO
2008
Springer
144views Optimization» more  GECCO 2008»
15 years 6 months ago
Self-adaptive constructivism in Neural XCS and XCSF
For artificial entities to achieve high degrees of autonomy they will need to display appropriate adaptability. In this sense adaptability includes representational flexibility gu...
Gerard David Howard, Larry Bull, Pier Luca Lanzi
ICML
2010
IEEE
15 years 6 months ago
Finite-Sample Analysis of LSTD
In this paper we consider the problem of policy evaluation in reinforcement learning, i.e., learning the value function of a fixed policy, using the least-squares temporal-differe...
Alessandro Lazaric, Mohammad Ghavamzadeh, Ré...