Sciweavers

771 search results - page 40 / 155
» Markov Decision Processes with Arbitrary Reward Processes
Sort
View
127
Voted
IJCAI
2003
15 years 3 months ago
Taming Decentralized POMDPs: Towards Efficient Policy Computation for Multiagent Settings
The problem of deriving joint policies for a group of agents that maximize some joint reward function can be modeled as a decentralized partially observable Markov decision proces...
Ranjit Nair, Milind Tambe, Makoto Yokoo, David V. ...
QEST
2009
IEEE
15 years 8 months ago
Nondeterministic Labeled Markov Processes: Bisimulations and Logical Characterization
We extend the theory of labeled Markov processes with internal nondeterminism, a fundamental concept for the further development of a process theory with abstraction on nondetermi...
Pedro R. D'Argenio, Nicolás Wolovick, Pedro...
JDCTA
2010
160views more  JDCTA 2010»
14 years 8 months ago
Learning and Decision Making in Human During a Game of Matching Pennies
To gain insights into the neural basis of such adaptive decision-making processes, we investigated the nature of learning process in humans playing a competitive game with binary ...
Jianfeng Hu, Xiaofeng Li, Jinghai Yin
ICML
2006
IEEE
16 years 2 months ago
Fast direct policy evaluation using multiscale analysis of Markov diffusion processes
Policy evaluation is a critical step in the approximate solution of large Markov decision processes (MDPs), typically requiring O(|S|3 ) to directly solve the Bellman system of |S...
Mauro Maggioni, Sridhar Mahadevan
ICML
2008
IEEE
16 years 2 months ago
Apprenticeship learning using linear programming
In apprenticeship learning, the goal is to learn a policy in a Markov decision process that is at least as good as a policy demonstrated by an expert. The difficulty arises in tha...
Umar Syed, Michael H. Bowling, Robert E. Schapire