Sciweavers

170 search results - page 23 / 34
» Learning to play Tetris applying reinforcement learning meth...
Sort
View
109
Voted
EXPERT
2008
134views more  EXPERT 2008»
15 years 13 days ago
Learning to Tag and Tagging to Learn: A Case Study on Wikipedia
Natural language technologies have been long envisioned to play a crucial role in transitioning from the current Web to a more "semantic" Web. If anything, the significa...
Peter Mika, Massimiliano Ciaramita, Hugo Zaragoza,...
86
Voted
IJCNN
2006
IEEE
15 years 6 months ago
An Adaptive Penalty-Based Learning Extension for Backpropagation and its Variants
Abstract— Over the years, many improvements and refinements of the backpropagation learning algorithm have been reported. In this paper, a new adaptive penalty-based learning ex...
Boris Jansen, Kenji Nakayama
128
Voted
CIG
2006
IEEE
15 years 6 months ago
Temporal Difference Learning Versus Co-Evolution for Acquiring Othello Position Evaluation
Abstract— This paper compares the use of temporal difference learning (TDL) versus co-evolutionary learning (CEL) for acquiring position evaluation functions for the game of Othe...
Simon M. Lucas, Thomas Philip Runarsson
ROBOCUP
2000
Springer
130views Robotics» more  ROBOCUP 2000»
15 years 4 months ago
Improvement Continuous Valued Q-learning and Its Application to Vision Guided Behavior Acquisition
Q-learning, a most widely used reinforcement learning method, normally needs well-defined quantized state and action spaces to converge. This makes it difficult to be applied to re...
Yasutake Takahashi, Masanori Takeda, Minoru Asada
103
Voted
ICML
2008
IEEE
16 years 1 months ago
Sample-based learning and search with permanent and transient memories
We present a reinforcement learning architecture, Dyna-2, that encompasses both samplebased learning and sample-based search, and that generalises across states during both learni...
David Silver, Martin Müller 0003, Richard S. ...