Search Sciweavers | Sciweavers

698 search results - page 90 / 140

» A Deterministic Algorithm for Solving Imprecise Decision Pro...

133

click to vote

GECCO
2006
Springer

208views Optimization» more GECCO 2006»

Comparing evolutionary and temporal difference methods in a reinforcement learning domain

15 years 7 months ago

Download www.cs.bham.ac.uk

Both genetic algorithms (GAs) and temporal difference (TD) methods have proven effective at solving reinforcement learning (RL) problems. However, since few rigorous empirical com...

Matthew E. Taylor, Shimon Whiteson, Peter Stone

claim paper

Read More »

117

click to vote

FLAIRS
1998

132views Artificial Intelligence» more FLAIRS 1998»

Analytical Design of Reinforcement Learning Tasks

15 years 4 months ago

Download www.aaai.org

Reinforcement learning (RL) problems constitute an important class of learning and control problems faced by artificial intelligence systems. In these problems, one is faced with ...

Robert E. Smith

claim paper

Read More »

Voted

ICML
2006
IEEE

131views Machine Learning» more ICML 2006»

Relational temporal difference learning

16 years 4 months ago

Download cll.stanford.edu

We introduce relational temporal difference learning as an effective approach to solving multi-agent Markov decision problems with large state spaces. Our algorithm uses temporal ...

Nima Asgharbeygi, David J. Stracuzzi, Pat Langley

claim paper

Read More »

111

click to vote

CDC
2008
IEEE

118views Control Systems» more CDC 2008»

A density projection approach to dimension reduction for continuous-state POMDPs

15 years 9 months ago

Download netfiles.uiuc.edu

Abstract— Research on numerical solution methods for partially observable Markov decision processes (POMDPs) has primarily focused on discrete-state models, and these algorithms ...

Enlu Zhou, Michael C. Fu, Steven I. Marcus

claim paper

Read More »

134

Voted

IJCAI
2001

185views Artificial Intelligence» more IJCAI 2001»

Symbolic Dynamic Programming for First-Order MDPs

15 years 4 months ago

Download www.cs.toronto.edu

We present a dynamic programming approach for the solution of first-order Markov decisions processes. This technique uses an MDP whose dynamics is represented in a variant of the ...

Craig Boutilier, Raymond Reiter, Bob Price

claim paper

Read More »

« Prev « First page 90 / 140 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers