A globally optimal algorithm for TTD-MDPs

15 years 5 months ago

Download www.cc.gatech.edu

In this paper, we discuss the use of Targeted Trajectory Distribution Markov Decision Processes (TTD-MDPs)—a variant of MDPs in which the goal is to realize a speciﬁed distribution of trajectories through a state space—as a general agent-coordination framework. We present several advances to previous work on TTD-MDPs. We improve on the existing algorithm for solving TTD-MDPs by deriving a greedy algorithm that ﬁnds a policy that provably minimizes the global KL-divergence from the target distribution. We test the new algorithm by applying TTD-MDPs to drama management, where a system must coordinate the behavior of many agents to ensure that a game follows a coherent storyline, is in keeping with the author’s desires, and offers a high degree of replayability. Although we show that suboptimal greedy strategies will fail in some cases, we validate previous work that suggests that they can work well in practice. We also show that our new algorithm provides guaranteed accuracy e...

Sooraj Bhat, David L. Roberts, Mark J. Nelson, Cha

Real-time Traffic

ATAL 2007 | Markov Decision | Suboptimal Greedy Strategies | Trajectory Distribution Markov |

claim paper

» Optimal algorithms for global optimization in case of unknown Lipschitz constant

» New Particle Swarm Optimization Algorithm Incorporating Reproduction Operator for Solving ...

» Optimal centers in branchandprune algorithms for univariate global optimization

» Metropolis Particle Swarm Optimization Algorithm with Mutation Operator for Global Optimiz...

» BBOBbenchmarking the DIRECT global optimization algorithm

» The differential AntStigmergy Algorithm for largescale global optimization

» GARS an improved genetic algorithm with reserve selection for global optimization

» Large scale global optimization using selfadaptive differential evolution algorithm

» Gradient estimation in global optimization algorithms

Post Info
More Details (n/a)

Added	07 Jun 2010
Updated	07 Jun 2010
Type	Conference
Year	2007
Where	ATAL
Authors	Sooraj Bhat, David L. Roberts, Mark J. Nelson, Charles L. Isbell, Michael Mateas

Comments (0)

Sciweavers

A globally optimal algorithm for TTD-MDPs

ATAL 2007 | Markov Decision | Suboptimal Greedy Strategies | Trajectory Distribution Markov |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers