Search Sciweavers | Sciweavers

1325 search results - page 64 / 265

» Algorithm Selection using Reinforcement Learning

179

click to vote

AIIDE
2008

146views Artificial Intelligence» more AIIDE 2008»

Agent Learning using Action-Dependent Learning Rates in Computer Role-Playing Games

15 years 9 months ago

Download www.aaai.org

We introduce the ALeRT (Action-dependent Learning Rates with Trends) algorithm that makes two modifications to the learning rate and one change to the exploration rate of traditio...

Maria Cutumisu, Duane Szafron, Michael H. Bowling,...

claim paper

Read More »

218

click to vote

ML
2008
ACM

152views Machine Learning» more ML 2008»

Learning near-optimal policies with Bellman-residual minimization based fitted policy iteration and a single sample path

15 years 6 months ago

Download hal.inria.fr

Abstract. We consider batch reinforcement learning problems in continuous space, expected total discounted-reward Markovian Decision Problems. As opposed to previous theoretical wo...

András Antos, Csaba Szepesvári, R&ea...

claim paper

Read More »

229

click to vote

AGI
2008

142views Artificial Intelligence» more AGI 2008»

Transfer Learning and Intelligence: an Argument and Approach

15 years 8 months ago

Download www.cs.utexas.edu

In order to claim fully general intelligence in an autonomous agent, the ability to learn is one of the most central capabilities. Classical machine learning techniques have had ma...

Matthew E. Taylor, Gregory Kuhlmann, Peter Stone

claim paper

Read More »

159

click to vote

ECAI
2004
Springer

123views Artificial Intelligence» more ECAI 2004»

Learning Techniques for Automatic Algorithm Portfolio Selection

16 years 7 days ago

Download www-lia.deis.unibo.it

The purpose of this paper is to show that a well known machine learning technique based on Decision Trees can be eﬀectively used to select the best approach (in terms of eﬃcien...

Alessio Guerri, Michela Milano

claim paper

Read More »

191

click to vote

ATAL
2008
Springer

123views Intelligent Agents» more ATAL 2008»

Sigma point policy iteration

15 years 8 months ago

Download web.mit.edu

In reinforcement learning, least-squares temporal difference methods (e.g., LSTD and LSPI) are effective, data-efficient techniques for policy evaluation and control with linear v...

Michael H. Bowling, Alborz Geramifard, David Winga...

claim paper

Read More »

« Prev « First page 64 / 265 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers