Search Sciweavers | Sciweavers

162 search results - page 10 / 33

» Off-Policy Temporal Difference Learning with Function Approx...

182

click to vote

IAT
2008
IEEE

161views Intelligent Agents» more IAT 2008»

Scaling Up Multi-agent Reinforcement Learning in Complex Domains

15 years 6 months ago

Download www3.ntu.edu.sg

TD-FALCON (Temporal Difference - Fusion Architecture for Learning, COgnition, and Navigation) is a class of self-organizing neural networks that incorporates Temporal Difference (...

Dan Xiao, Ah-Hwee Tan

claim paper

Read More »

164

click to vote

ML
1998
ACM

136views Machine Learning» more ML 1998»

Co-Evolution in the Successful Learning of Backgammon Strategy

15 years 6 months ago

Download www.demo.cs.brandeis.edu

Following Tesauro’s work on TD-Gammon, we used a 4000 parameter feed-forward neural network to develop a competitive backgammon evaluation function. Play proceeds by a roll of t...

Jordan B. Pollack, Alan D. Blair

claim paper

Read More »

162

click to vote

JMLR
2010

103views more JMLR 2010»

Learning Nonlinear Dynamic Models from Non-sequenced Data

15 years 1 months ago

Download www.cs.cmu.edu

Virtually all methods of learning dynamic systems from data start from the same basic assumption: the learning algorithm will be given a sequence of data generated from the dynami...

Tzu-Kuo Huang, Le Song, Jeff Schneider

claim paper

Read More »

198

click to vote

ICANN
2010
Springer

151views Neural Networks» more ICANN 2010»

Dynamics and Function of a CA1 Model of the Hippocampus during Theta and Ripples

15 years 4 months ago

Download people.bu.edu

The hippocampus is known to be involved in spatial learning in rats. Spatial learning involves the encoding and replay of temporally sequenced spatial information. Temporally seque...

Vassilis Cutsuridis, Michael E. Hasselmo

claim paper

Read More »

167

click to vote

NIPS
1996

134views Information Technology» more NIPS 1996»

Why did TD-Gammon Work?

15 years 7 months ago

Download www.cse.unsw.edu.au

Although TD-Gammon is one of the major successes in machine learning, it has not led to similar impressive breakthroughs in temporal difference learning for other applications or ...

Jordan B. Pollack, Alan D. Blair

claim paper

Read More »

« Prev « First page 10 / 33 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers