Search Sciweavers | Sciweavers

2011 search results - page 182 / 403

» Universal Reinforcement Learning

151

Voted

ML
2006
ACM

99views Machine Learning» more ML 2006»

Universal parameter optimisation in games based on SPSA

15 years 4 months ago

Download www.jhuapl.edu

Most game programs have a large number of parameters that are crucial for their performance. While tuning these parameters by hand is rather difficult, efficient and easy to use ge...

Levente Kocsis, Csaba Szepesvári

claim paper

Read More »

126

click to vote

WWW
2004
ACM

115views Internet Technology» more WWW 2004»

Integrating learning objects into an open learning environment: evaluation of learning processes in an informatics learning lab

16 years 5 months ago

Download www.iw3c2.org

The Didactics of Informatics research group at the University of Paderborn is involved in efforts to design implement and evaluate a web-based learning laboratory for informatics ...

Johannes Magenheim, Olaf Scheel

claim paper

Read More »

136

click to vote

NIPS
1996

134views Information Technology» more NIPS 1996»

Why did TD-Gammon Work?

15 years 6 months ago

Download www.cse.unsw.edu.au

Although TD-Gammon is one of the major successes in machine learning, it has not led to similar impressive breakthroughs in temporal difference learning for other applications or ...

Jordan B. Pollack, Alan D. Blair

claim paper

Read More »

178

click to vote

GECCO
2008
Springer

144views Optimization» more GECCO 2008»

Self-adaptive constructivism in Neural XCS and XCSF

15 years 6 months ago

Download www.cems.uwe.ac.uk

For artificial entities to achieve high degrees of autonomy they will need to display appropriate adaptability. In this sense adaptability includes representational flexibility gu...

Gerard David Howard, Larry Bull, Pier Luca Lanzi

claim paper

Read More »

134

click to vote

ICML
2010
IEEE

167views Machine Learning» more ICML 2010»

Finite-Sample Analysis of LSTD

15 years 6 months ago

Download hal.inria.fr

In this paper we consider the problem of policy evaluation in reinforcement learning, i.e., learning the value function of a fixed policy, using the least-squares temporal-differe...

Alessandro Lazaric, Mohammad Ghavamzadeh, Ré...

claim paper

Read More »

« Prev « First page 182 / 403 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers