Search Sciweavers | Sciweavers

771 search results - page 40 / 155

» Markov Decision Processes with Arbitrary Reward Processes

161

click to vote

IJCAI
2003

142views Artificial Intelligence» more IJCAI 2003»

Taming Decentralized POMDPs: Towards Efficient Policy Computation for Multiagent Settings

15 years 7 months ago

Download dli.iiit.ac.in

The problem of deriving joint policies for a group of agents that maximize some joint reward function can be modeled as a decentralized partially observable Markov decision proces...

Ranjit Nair, Milind Tambe, Makoto Yokoo, David V. ...

claim paper

Read More »

162

click to vote

QEST
2009
IEEE

144views Modeling and Simulation» more QEST 2009»

Nondeterministic Labeled Markov Processes: Bisimulations and Logical Characterization

16 years 9 days ago

Download www.cs.famaf.unc.edu.ar

We extend the theory of labeled Markov processes with internal nondeterminism, a fundamental concept for the further development of a process theory with abstraction on nondetermi...

Pedro R. D'Argenio, Nicolás Wolovick, Pedro...

claim paper

Read More »

172

click to vote

JDCTA
2010

160views more JDCTA 2010»

Learning and Decision Making in Human During a Game of Matching Pennies

15 years 11 days ago

Download www.aicit.org

To gain insights into the neural basis of such adaptive decision-making processes, we investigated the nature of learning process in humans playing a competitive game with binary ...

Jianfeng Hu, Xiaofeng Li, Jinghai Yin

claim paper

Read More »

167

click to vote

ICML
2006
IEEE

143views Machine Learning» more ICML 2006»

Fast direct policy evaluation using multiscale analysis of Markov diffusion processes

16 years 6 months ago

Download www.cs.umass.edu

Policy evaluation is a critical step in the approximate solution of large Markov decision processes (MDPs), typically requiring O(|S|3 ) to directly solve the Bellman system of |S...

Mauro Maggioni, Sridhar Mahadevan

claim paper

Read More »

132

click to vote

ICML
2008
IEEE

147views Machine Learning» more ICML 2008»

Apprenticeship learning using linear programming

16 years 6 months ago

Download www.cs.ualberta.ca

In apprenticeship learning, the goal is to learn a policy in a Markov decision process that is at least as good as a policy demonstrated by an expert. The difficulty arises in tha...

Umar Syed, Michael H. Bowling, Robert E. Schapire

claim paper

Read More »

« Prev « First page 40 / 155 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers