Search Sciweavers | Sciweavers

771 search results - page 3 / 155

» Markov Decision Processes with Arbitrary Reward Processes

click to vote

CORR
2011
Springer

183views Education» more CORR 2011»

Mean-Variance Optimization in Markov Decision Processes

13 years 18 days ago

Download web.mit.edu

We consider ﬁnite horizon Markov decision processes under performance measures that involve both the mean and the variance of the cumulative reward. We show that either randomiz...

Shie Mannor, John N. Tsitsiklis

claim paper

Read More »

click to vote

NIPS
2004

103views Information Technology» more NIPS 2004»

Experts in a Markov Decision Process

13 years 7 months ago

Download books.nips.cc

We consider an MDP setting in which the reward function is allowed to change during each time step of play (possibly in an adversarial manner), yet the dynamics remain fixed. Simi...

Eyal Even-Dar, Sham M. Kakade, Yishay Mansour

claim paper

Read More »

click to vote

ECML
2005
Springer

143views Machine Learning» more ECML 2005»

Active Learning in Partially Observable Markov Decision Processes

13 years 11 months ago

Download www.cs.mcgill.ca

This paper examines the problem of ﬁnding an optimal policy for a Partially Observable Markov Decision Process (POMDP) when the model is not known or is only poorly speciﬁed. W...

Robin Jaulmes, Joelle Pineau, Doina Precup

claim paper

Read More »

click to vote

COLING
2010

138views Computational Linguistics» more COLING 2010»

Controlling Listening-oriented Dialogue using Partially Observable Markov Decision Processes

13 years 17 days ago

Download aclweb.org

This paper investigates how to automatically create a dialogue control component of a listening agent to reduce the current high cost of manually creating such components. We coll...

Toyomi Meguro, Ryuichiro Higashinaka, Yasuhiro Min...

claim paper

Read More »

click to vote

UAI
2003

87views Artificial Intelligence» more UAI 2003»

Implementation and Comparison of Solution Methods for Decision Processes with Non-Markovian Rewards

13 years 7 months ago

Download users.cecs.anu.edu.au

This paper examines a number of solution methods for decision processes with non-Markovian rewards (NMRDPs). They all exploit a temporal logic speciﬁcation of the reward functio...

Charles Gretton, David Price, Sylvie Thiéba...

claim paper

Read More »

« Prev « First page 3 / 155 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers