Search Sciweavers | Sciweavers

81 search results - page 5 / 17

» Chess Neighborhoods, Function Combination, and Reinforcement...

click to vote

DICTA
2007

132views Applied Computing» more DICTA 2007»

Fuzzy Model Based Recognition of Handwritten Hindi Characters

13 years 6 months ago

Download eprints.qut.edu.au

This paper presents the recognition of handwritten Hindi Characters based on the modified exponential membership function fitted to the fuzzy sets derived from features consisting...

Madasu Hanmandlu, O. V. Ramana Murthy, Vamsi Krish...

claim paper

Read More »

click to vote

IJCAI
2007

254views Artificial Intelligence» more IJCAI 2007»

Bayesian Inverse Reinforcement Learning

13 years 6 months ago

Download www.ijcai.org

Inverse Reinforcement Learning (IRL) is the problem of learning the reward function underlying a Markov Decision Process given the dynamics of the system and the behaviour of an e...

Deepak Ramachandran, Eyal Amir

claim paper

Read More »

click to vote

NIPS
1993

123views Information Technology» more NIPS 1993»

Temporal Difference Learning of Position Evaluation in the Game of Go

13 years 6 months ago

Download www.gatsby.ucl.ac.uk

The game of Go has a high branching factor that defeats the tree search approach used in computer chess, and long-range spatiotemporal interactions that make position evaluation e...

Nicol N. Schraudolph, Peter Dayan, Terrence J. Sej...

claim paper

Read More »

click to vote

ECML
2004
Springer

112views Machine Learning» more ECML 2004»

Convergence and Divergence in Standard and Averaging Reinforcement Learning

13 years 10 months ago

Download igitur-archive.library.uu.nl

Although tabular reinforcement learning (RL) methods have been proved to converge to an optimal policy, the combination of particular conventional reinforcement learning techniques...

Marco Wiering

claim paper

Read More »

click to vote

CG
2006
Springer

155views Computer Graphics» more CG 2006»

Feature Construction for Reinforcement Learning in Hearts

13 years 7 months ago

Download webdocs.cs.ualberta.ca

Temporal difference (TD) learning has been used to learn strong evaluation functions in a variety of two-player games. TD-gammon illustrated how the combination of game tree search...

Nathan R. Sturtevant, Adam M. White

claim paper

Read More »

« Prev « First page 5 / 17 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers