Sciweavers

170 search results - page 23 / 34
» Learning to play Tetris applying reinforcement learning meth...
Sort
View
EXPERT
2008
134views more  EXPERT 2008»
14 years 9 months ago
Learning to Tag and Tagging to Learn: A Case Study on Wikipedia
Natural language technologies have been long envisioned to play a crucial role in transitioning from the current Web to a more "semantic" Web. If anything, the significa...
Peter Mika, Massimiliano Ciaramita, Hugo Zaragoza,...
73
Voted
IJCNN
2006
IEEE
15 years 3 months ago
An Adaptive Penalty-Based Learning Extension for Backpropagation and its Variants
Abstract— Over the years, many improvements and refinements of the backpropagation learning algorithm have been reported. In this paper, a new adaptive penalty-based learning ex...
Boris Jansen, Kenji Nakayama
CIG
2006
IEEE
15 years 3 months ago
Temporal Difference Learning Versus Co-Evolution for Acquiring Othello Position Evaluation
Abstract— This paper compares the use of temporal difference learning (TDL) versus co-evolutionary learning (CEL) for acquiring position evaluation functions for the game of Othe...
Simon M. Lucas, Thomas Philip Runarsson
ROBOCUP
2000
Springer
130views Robotics» more  ROBOCUP 2000»
15 years 1 months ago
Improvement Continuous Valued Q-learning and Its Application to Vision Guided Behavior Acquisition
Q-learning, a most widely used reinforcement learning method, normally needs well-defined quantized state and action spaces to converge. This makes it difficult to be applied to re...
Yasutake Takahashi, Masanori Takeda, Minoru Asada
ICML
2008
IEEE
15 years 10 months ago
Sample-based learning and search with permanent and transient memories
We present a reinforcement learning architecture, Dyna-2, that encompasses both samplebased learning and sample-based search, and that generalises across states during both learni...
David Silver, Martin Müller 0003, Richard S. ...