Search Sciweavers | Sciweavers

162 search results - page 18 / 33

» Off-Policy Temporal Difference Learning with Function Approx...

Voted

IJCAI
2007

173views Artificial Intelligence» more IJCAI 2007»

Reinforcement Learning of Local Shape in the Game of Go

15 years 1 months ago

Download webdocs.cs.ualberta.ca

We explore an application to the game of Go of a reinforcement learning approach based on a linear evaluation function and large numbers of binary features. This strategy has prov...

David Silver, Richard S. Sutton, Martin Mülle...

claim paper

Read More »

Voted

ICMLA
2007

92views Machine Learning» more ICMLA 2007»

Control of a re-entrant line manufacturing model with a reinforcement learning approach

15 years 1 months ago

Download www.smitlab.uc.edu

This paper presents the application of a reinforcement learning (RL) approach for the near-optimal control of a re-entrant line manufacturing (RLM) model. The RL approach utilizes...

José A. Ramírez-Hernández, Em...

claim paper

Read More »

Voted

ATAL
2009
Springer

135views Intelligent Agents» more ATAL 2009»

An empirical analysis of value function-based and policy search reinforcement learning

15 years 6 months ago

Download userweb.cs.utexas.edu

In several agent-oriented scenarios in the real world, an autonomous agent that is situated in an unknown environment must learn through a process of trial and error to take actio...

Shivaram Kalyanakrishnan, Peter Stone

claim paper

Read More »

101

click to vote

ICML
2007
IEEE

180views Machine Learning» more ICML 2007»

Bayesian actor-critic algorithms

16 years 14 days ago

Download www.machinelearning.org

We1 present a new actor-critic learning model in which a Bayesian class of non-parametric critics, using Gaussian process temporal difference learning is used. Such critics model ...

Mohammad Ghavamzadeh, Yaakov Engel

claim paper

Read More »

Voted

ML
2007
ACM

106views Machine Learning» more ML 2007»

Surrogate maximization/minimization algorithms and extensions

14 years 11 months ago

Download www.cs.ust.hk

Abstract Surrogate maximization (or minimization) (SM) algorithms are a family of algorithms that can be regarded as a generalization of expectation-maximization (EM) algorithms. A...

Zhihua Zhang, James T. Kwok, Dit-Yan Yeung

claim paper

Read More »

« Prev « First page 18 / 33 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers