Search Sciweavers | Sciweavers

1118 search results - page 55 / 224

» Relational temporal difference learning

186

click to vote

AUSAI
2008
Springer

185views Artificial Intelligence» more AUSAI 2008»

Character Recognition Using Hierarchical Vector Quantization and Temporal Pooling

15 years 8 months ago

Download www98.griffith.edu.au

In recent years, there has been a cross-fertilization of ideas between computational neuroscience models of the operation of the neocortex and artificial intelligence models of mac...

John Thornton, Jolon Faichney, Michael Blumenstein...

claim paper

Read More »

125

click to vote

CG
2002
Springer

96views Computer Graphics» more CG 2002»

Learning a Game Strategy Using Pattern-Weights and Self-play

15 years 6 months ago

Download www.arishapiro.com

Abstract. This paper demonstrates the use of pattern-weights in order to develop a strategy for an automated player of a non-cooperative version of the game of Diplomacy. Diplomacy...

Ari Shapiro, Gil Fuchs, Robert Levinson

claim paper

Read More »

167

Voted

CG
2006
Springer

155views Computer Graphics» more CG 2006»

Feature Construction for Reinforcement Learning in Hearts

15 years 8 months ago

Download webdocs.cs.ualberta.ca

Temporal difference (TD) learning has been used to learn strong evaluation functions in a variety of two-player games. TD-gammon illustrated how the combination of game tree search...

Nathan R. Sturtevant, Adam M. White

claim paper

Read More »

144

click to vote

IJCAI
2007

223views Artificial Intelligence» more IJCAI 2007»

Relational Knowledge with Predictive State Representations

15 years 7 months ago

Download web.mit.edu

Most work on Predictive Representations of State (PSRs) has focused on learning and planning in unstructured domains (for example, those represented by ﬂat POMDPs). This paper e...

David Wingate, Vishal Soni, Britton Wolfe, Satinde...

claim paper

Read More »

176

click to vote

ICML
2004
IEEE

156views Machine Learning» more ICML 2004»

Learning to fly by combining reinforcement learning with behavioural cloning

16 years 7 months ago

Download ccc.inaoep.mx

Reinforcement learning deals with learning optimal or near optimal policies while interacting with the environment. Application domains with many continuous variables are difficul...

Eduardo F. Morales, Claude Sammut

claim paper

Read More »

« Prev « First page 55 / 224 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers