Search Sciweavers | Sciweavers

268 search results - page 46 / 54

» Solving multiagent assignment Markov decision processes

220

Voted

ATAL
2010
Springer

128views Intelligent Agents» more ATAL 2010»

Approximate dynamic programming with affine ADDs

15 years 2 months ago

Download eprints.pascal-network.org

The Affine ADD (AADD) is an extension of the Algebraic Decision Diagram (ADD) that compactly represents context-specific, additive and multiplicative structure in functions from a...

Scott Sanner, William T. B. Uther, Karina Valdivia...

claim paper

Read More »

207

click to vote

ECML
2007
Springer

192views Machine Learning» more ECML 2007»

Policy Gradient Critics

16 years 1 months ago

Download www.idsia.ch

We present Policy Gradient Actor-Critic (PGAC), a new model-free Reinforcement Learning (RL) method for creating limited-memory stochastic policies for Partially Observable Markov ...

Daan Wierstra, Jürgen Schmidhuber

claim paper

Read More »

185

Voted

JAIR
2006

120views more JAIR 2006»

FluCaP: A Heuristic Search Planner for First-Order MDPs

15 years 7 months ago

Download www.jair.org

We present a heuristic search algorithm for solving first-order Markov Decision Processes (FOMDPs). Our approach combines first-order state abstraction that avoids evaluating stat...

Steffen Hölldobler, Eldar Karabaev, Olga Skvo...

claim paper

Read More »

221

click to vote

JAIR
2010

115views more JAIR 2010»

An Investigation into Mathematical Programming for Finite Horizon Decentralized POMDPs

15 years 5 months ago

Download www.jair.org

Decentralized planning in uncertain environments is a complex task generally dealt with by using a decision-theoretic approach, mainly through the framework of Decentralized Parti...

Raghav Aras, Alain Dutech

claim paper

Read More »

227

Voted

ATAL
2011
Springer

208views Intelligent Agents» more ATAL 2011»

Incentive design for adaptive agents

14 years 7 months ago

Download www.eecs.harvard.edu

We consider a setting in which a principal seeks to induce an adaptive agent to select a target action by providing incentives on one or more actions. The agent maintains a belief...

Yiling Chen, Jerry Kung, David C. Parkes, Ariel D....

claim paper

Read More »

« Prev « First page 46 / 54 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers