Search Sciweavers | Sciweavers

495 search results - page 54 / 99

» Approximation algorithms for budgeted learning problems

click to vote

ICPR
2006
IEEE

260views computer vision» more ICPR 2006»

Control Double Inverted Pendulum by Reinforcement Learning with Double CMAC Network

15 years 10 months ago

Download ee2.chit.edu.tw

To accelerate the learning of reinforcement learning, many types of function approximation are used to represent state value. However function approximation reduces the accuracy o...

Siwei Luo, Yu Zheng, Ziang Lv

claim paper

Read More »

click to vote

JMLR
2006

153views more JMLR 2006»

Collaborative Multiagent Reinforcement Learning by Payoff Propagation

14 years 9 months ago

Download jmlr.csail.mit.edu

In this article we describe a set of scalable techniques for learning the behavior of a group of agents in a collaborative multiagent setting. As a basis we use the framework of c...

Jelle R. Kok, Nikos A. Vlassis

claim paper

Read More »

click to vote

NIPS
1994

90views Information Technology» more NIPS 1994»

Reinforcement Learning with Soft State Aggregation

14 years 11 months ago

Download www.eecs.umich.edu

It is widely accepted that the use of more compact representations than lookup tables is crucial to scaling reinforcement learning (RL) algorithms to real-world problems. Unfortun...

Satinder P. Singh, Tommi Jaakkola, Michael I. Jord...

claim paper

Read More »

click to vote

ML
2007
ACM

106views Machine Learning» more ML 2007»

Surrogate maximization/minimization algorithms and extensions

14 years 9 months ago

Download www.cs.ust.hk

Abstract Surrogate maximization (or minimization) (SM) algorithms are a family of algorithms that can be regarded as a generalization of expectation-maximization (EM) algorithms. A...

Zhihua Zhang, James T. Kwok, Dit-Yan Yeung

claim paper

Read More »

107

click to vote

ECML
2007
Springer

167views Machine Learning» more ECML 2007»

Efficient Continuous-Time Reinforcement Learning with Adaptive State Graphs

15 years 1 months ago

Download www.igi.tugraz.at

Abstract. We present a new reinforcement learning approach for deterministic continuous control problems in environments with unknown, arbitrary reward functions. The difficulty of...

Gerhard Neumann, Michael Pfeiffer, Wolfgang Maass

claim paper

Read More »

« Prev « First page 54 / 99 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers