Search Sciweavers | Sciweavers

288 search results - page 18 / 58

» Learning to Play Chess Using Temporal Differences

click to vote

ICANN
2010
Springer

157views Neural Networks» more ICANN 2010»

Learning in a Unitary Coherent Hippocampus

14 years 7 months ago

Download mushika.shef.ac.uk

Abstract. A previous paper [2] presented a model (UCPF-HC) of the hippocampus as a unitary coherent particle filter, which combines the classical hippocampal roles of associative m...

Charles W. Fox, Tony J. Prescott

claim paper

Read More »

Voted

CG
2002
Springer

96views Computer Graphics» more CG 2002»

Learning a Game Strategy Using Pattern-Weights and Self-play

14 years 9 months ago

Download www.arishapiro.com

Abstract. This paper demonstrates the use of pattern-weights in order to develop a strategy for an automated player of a non-cooperative version of the game of Diplomacy. Diplomacy...

Ari Shapiro, Gil Fuchs, Robert Levinson

claim paper

Read More »

Voted

NIPS
2001

206views Information Technology» more NIPS 2001»

Model-Free Least-Squares Policy Iteration

14 years 11 months ago

Download www.cs.duke.edu

We propose a new approach to reinforcement learning which combines least squares function approximation with policy iteration. Our method is model-free and completely off policy. ...

Michail G. Lagoudakis, Ronald Parr

claim paper

Read More »

click to vote

ICML
2009
IEEE

143views Machine Learning» more ICML 2009»

Proto-predictive representation of states with simple recurrent temporal-difference networks

15 years 10 months ago

Download www.snowelm.com

We propose a new neural network architecture, called Simple Recurrent Temporal-Difference Networks (SR-TDNs), that learns to predict future observations in partially observable en...

Takaki Makino

claim paper

Read More »

click to vote

ROMAN
2007
IEEE

191views Robotics» more ROMAN 2007»

Learning and Recognition of Object Manipulation Actions Using Linear and Nonlinear Dimensionality Reduction

15 years 3 months ago

Download www.csc.kth.se

— In this work, we perform an extensive statistical evaluation for learning and recognition of object manipulation actions. We concentrate on single arm/hand actions but study th...

Isabel Serrano Vicente, Danica Kragic, Jan-Olof Ek...

claim paper

Read More »

« Prev « First page 18 / 58 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers