Search Sciweavers | Sciweavers

4544 search results - page 33 / 909

» Reinforcement Learning with Time

189

click to vote

ICML
1998
IEEE

268views Machine Learning» more ICML 1998»

The MAXQ Method for Hierarchical Reinforcement Learning

16 years 8 months ago

Download www.cs.ualberta.ca

This paper presents a new approach to hierarchical reinforcement learning based on the MAXQ decomposition of the value function. The MAXQ decomposition has both a procedural seman...

Thomas G. Dietterich

claim paper

Read More »

225

click to vote

CIG
2005
IEEE

131views Applied Computing» more CIG 2005»

A Survey on Multiagent Reinforcement Learning Towards Multi-Robot Systems

15 years 9 months ago

Download cswww.essex.ac.uk

Abstract- Multiagent reinforcement learning for multirobot systems is a challenging issue in both robotics and artiﬁcial intelligence. With the ever increasing interests in theor...

Erfu Yang, Dongbing Gu

claim paper

Read More »

186

click to vote

AAAI
2010

171views Intelligent Agents» more AAAI 2010»

Reinforcement Learning via AIXI Approximation

15 years 9 months ago

Download jveness.info

This paper introduces a principled approach for the design of a scalable general reinforcement learning agent. This approach is based on a direct approximation of AIXI, a Bayesian...

Joel Veness, Kee Siong Ng, Marcus Hutter, David Si...

claim paper

Read More »

241

click to vote

IJCAI
2007

179views Artificial Intelligence» more IJCAI 2007»

Heuristic Selection of Actions in Multiagent Reinforcement Learning

15 years 9 months ago

Download www.ijcai.org

This work presents a new algorithm, called Heuristically Accelerated Minimax-Q (HAMMQ), that allows the use of heuristics to speed up the wellknown Multiagent Reinforcement Learni...

Reinaldo A. C. Bianchi, Carlos H. C. Ribeiro, Anna...

claim paper

Read More »

197

click to vote

AUSAI
2005
Springer

166views Artificial Intelligence» more AUSAI 2005»

Adaptive Utility-Based Scheduling in Resource-Constrained Systems

16 years 1 months ago

Download labs.oracle.com

This paper addresses the problem of scheduling jobs in soft real-time systems, where the utility of completing each job decreases over time. We present a utility-based framework fo...

David Vengerov

claim paper

Read More »

« Prev « First page 33 / 909 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers