Search Sciweavers | Sciweavers

9841 search results - page 286 / 1969

» Distributed Value Functions

171

click to vote

ICML
2001
IEEE

145views Machine Learning» more ICML 2001»

Symmetry in Markov Decision Processes and its Implications for Single Agent and Multiagent Learning

16 years 7 months ago

Download www-2.cs.cmu.edu

This paper examines the notion of symmetry in Markov decision processes (MDPs). We define symmetry for an MDP and show how it can be exploited for more effective learning in singl...

Martin Zinkevich, Tucker R. Balch

claim paper

Read More »

169

click to vote

ICML
1998
IEEE

268views Machine Learning» more ICML 1998»

The MAXQ Method for Hierarchical Reinforcement Learning

16 years 7 months ago

Download www.cs.ualberta.ca

This paper presents a new approach to hierarchical reinforcement learning based on the MAXQ decomposition of the value function. The MAXQ decomposition has both a procedural seman...

Thomas G. Dietterich

claim paper

Read More »

164

click to vote

ATAL
2005
Springer

130views Intelligent Agents» more ATAL 2005»

Behavior transfer for value-function-based reinforcement learning

16 years 23 hour ago

Download www.cs.huji.ac.il

Temporal difference (TD) learning methods [22] have become popular reinforcement learning techniques in recent years. TD methods have had some experimental successes and have been...

Matthew E. Taylor, Peter Stone

claim paper

Read More »

144

click to vote

STOC
2000
ACM

134views Algorithms» more STOC 2000»

Computing the median with uncertainty

15 years 10 months ago

Download www.cs.cmu.edu

We consider a new model for computing with uncertainty. It is desired to compute a function fX1; : : : ; Xn where X1; : : : ; Xn are unknown, but guaranteed to lie in speci ed i...

Tomás Feder, Rajeev Motwani, Rina Panigrahy...

claim paper

Read More »

206

click to vote

HIM
1997
Springer

162views Multimedia» more HIM 1997»

Probabilistic Logical Information Retrieval for Content, Hypertext, and Database Querying

15 years 10 months ago

Download lrb.cs.uni-dortmund.de

Classical retrieval models support content-oriented searching for documents using a set of words as data model. However, in hypertext and database applications we want to consider...

Thomas Rölleke, Markus Blömer

claim paper

Read More »

« Prev « First page 286 / 1969 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers