Search Sciweavers | Sciweavers

170 search results - page 23 / 34

» Learning to play Tetris applying reinforcement learning meth...

109

Voted

EXPERT
2008

134views more EXPERT 2008»

Learning to Tag and Tagging to Learn: A Case Study on Wikipedia

15 years 13 days ago

Download research.yahoo.com

Natural language technologies have been long envisioned to play a crucial role in transitioning from the current Web to a more "semantic" Web. If anything, the significa...

Peter Mika, Massimiliano Ciaramita, Hugo Zaragoza,...

claim paper

Read More »

Voted

IJCNN
2006
IEEE

126views Neural Networks» more IJCNN 2006»

An Adaptive Penalty-Based Learning Extension for Backpropagation and its Variants

15 years 6 months ago

Download leo.ec.t.kanazawa-u.ac.jp

Abstract— Over the years, many improvements and reﬁnements of the backpropagation learning algorithm have been reported. In this paper, a new adaptive penalty-based learning ex...

Boris Jansen, Kenji Nakayama

claim paper

Read More »

128

Voted

CIG
2006
IEEE

202views Applied Computing» more CIG 2006»

Temporal Difference Learning Versus Co-Evolution for Acquiring Othello Position Evaluation

15 years 6 months ago

Download algoval.essex.ac.uk

Abstract— This paper compares the use of temporal difference learning (TDL) versus co-evolutionary learning (CEL) for acquiring position evaluation functions for the game of Othe...

Simon M. Lucas, Thomas Philip Runarsson

claim paper

Read More »

109

click to vote

ROBOCUP
2000
Springer

130views Robotics» more ROBOCUP 2000»

Improvement Continuous Valued Q-learning and Its Application to Vision Guided Behavior Acquisition

15 years 4 months ago

Download www.er.ams.eng.osaka-u.ac.jp

Q-learning, a most widely used reinforcement learning method, normally needs well-defined quantized state and action spaces to converge. This makes it difficult to be applied to re...

Yasutake Takahashi, Masanori Takeda, Minoru Asada

claim paper

Read More »

103

Voted

ICML
2008
IEEE

117views Machine Learning» more ICML 2008»

Sample-based learning and search with permanent and transient memories

16 years 1 months ago

Download www.cs.ualberta.ca

We present a reinforcement learning architecture, Dyna-2, that encompasses both samplebased learning and sample-based search, and that generalises across states during both learni...

David Silver, Martin Müller 0003, Richard S. ...

claim paper

Read More »

« Prev « First page 23 / 34 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers