Search Sciweavers | Sciweavers

9 search results - page 1 / 2

» Monte-Carlo Go Reinforcement Learning Experiments

click to vote

CIG
2006
IEEE

190views Applied Computing» more CIG 2006»

Monte-Carlo Go Reinforcement Learning Experiments

13 years 10 months ago

Download www.math-info.univ-paris5.fr

Abstract— This paper describes experiments using reinforcement learning techniques to compute pattern urgencies used during simulations performed in a Monte-Carlo Go architecture...

Bruno Bouzy, Guillaume Chaslot

claim paper

Read More »

click to vote

ICML
2009
IEEE

131views Machine Learning» more ICML 2009»

Monte-Carlo simulation balancing

14 years 5 months ago

Download www.cs.ualberta.ca

In this paper we introduce the first algorithms for efficiently learning a simulation policy for Monte-Carlo search. Our main idea is to optimise the balance of a simulation polic...

David Silver, Gerald Tesauro

claim paper

Read More »

click to vote

ICML
2000
IEEE

153views Machine Learning» more ICML 2000»

Eligibility Traces for Off-Policy Policy Evaluation

14 years 5 months ago

Download www.cs.ualberta.ca

Eligibility traces have been shown to speed reinforcement learning, to make it more robust to hidden states, and to provide a link between Monte Carlo and temporal-difference meth...

Doina Precup, Richard S. Sutton, Satinder P. Singh

claim paper

Read More »

click to vote

ICRA
2009
IEEE

138views Robotics» more ICRA 2009»

Which landmark is useful? Learning selection policies for navigation in unknown environments

13 years 11 months ago

Download europa.informatik.uni-freiburg.de

Abstract— In general, a mobile robot that operates in unknown environments has to maintain a map and has to determine its own location given the map. This introduces signiﬁcant...

Hauke Strasdat, Cyrill Stachniss, Wolfram Burgard

claim paper

Read More »

click to vote

ACG
2003
Springer

157views Computer Graphics» more ACG 2003»

Evaluation in Go by a Neural Network using Soft Segmentation

13 years 10 months ago

Download webdocs.cs.ualberta.ca

In this article a neural network architecture is presented that is able to build a soft segmentation of a two-dimensional input. This network architecture is applied to position ev...

Markus Enzenberger

claim paper

Read More »

« Prev « First page 1 / 2 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers