Sciweavers

2011 search results - page 183 / 403
» Universal Reinforcement Learning
Sort
View
ROBOCUP
2000
Springer
104views Robotics» more  ROBOCUP 2000»
15 years 8 months ago
Essex Wizards 2000 Team Description
: This article gives an overview of the Essex Wizards 2000 team participated in the RoboCup 2000 simulator league. A brief description of the agent architecture for the team is int...
Huosheng Hu, Kostas Kostiadis, Matthew Hunter, Kos...
138
Voted
ESANN
2008
15 years 6 months ago
Similarities and differences between policy gradient methods and evolution strategies
Natural policy gradient methods and the covariance matrix adaptation evolution strategy, two variable metric methods proposed for solving reinforcement learning tasks, are contrast...
Verena Heidrich-Meisner, Christian Igel
144
Voted
NIPS
2007
15 years 6 months ago
Stable Dual Dynamic Programming
Recently, we have introduced a novel approach to dynamic programming and reinforcement learning that is based on maintaining explicit representations of stationary distributions i...
Tao Wang, Daniel J. Lizotte, Michael H. Bowling, D...
CSEE
2003
Springer
15 years 10 months ago
A Coordinated Plan for Teaching Software Engineering in the Rey Juan Carlos University
Nowadays both industry and academic environments are showing a lot of interest in the Software Engineering discipline. Therefore, it is a challenge for universities to provide stu...
Jorge Enrique Pérez-Martínez, Almude...
132
Voted
SIGCSE
2008
ACM
132views Education» more  SIGCSE 2008»
15 years 4 months ago
A case study of retention practices at the University of Illinois at Urbana-Champaign
Computer science is seeing a decline in enrollment at all levels of education. One key strategy for reversing this decline is to improve methods of student retention. This paper, ...
Tanya L. Crenshaw, Erin W. Chambers, Heather Metca...