Search Sciweavers | Sciweavers

771 search results - page 20 / 155

» Markov Decision Processes with Arbitrary Reward Processes

100

click to vote

ALT
2008
Springer

141views Machine Learning» more ALT 2008»

Online Regret Bounds for Markov Decision Processes with Deterministic Transitions

15 years 10 months ago

Download personal.unileoben.ac.at

Abstract. We consider an upper conﬁdence bound algorithm for Markov decision processes (MDPs) with deterministic transitions. For this algorithm we derive upper bounds on the onl...

Ronald Ortner

claim paper

Read More »

123

click to vote

QEST
2008
IEEE

146views Modeling and Simulation» more QEST 2008»

Symbolic Magnifying Lens Abstraction in Markov Decision Processes

15 years 8 months ago

Download www.ee.ucla.edu

Magnifying Lens Abstraction in Markov Decision Processes ∗ Pritam Roy1 David Parker2 Gethin Norman2 Luca de Alfaro1 Computer Engineering Dept, UC Santa Cruz, Santa Cruz, CA, USA ...

Pritam Roy, David Parker, Gethin Norman, Luca de A...

claim paper

Read More »

130

Voted

ECAI
2008
Springer

158views Artificial Intelligence» more ECAI 2008»

A Simulation-based Approach for Solving Generalized Semi-Markov Decision Processes

15 years 3 months ago

Download emmanuel.rachelson.free.fr

Time is a crucial variable in planning and often requires special attention since it introduces a specific structure along with additional complexity, especially in the case of dec...

Emmanuel Rachelson, Gauthier Quesnel, Fréd&...

claim paper

Read More »

109

Voted

DA
2010

139views more DA 2010»

Eliciting Patients' Revealed Preferences: An Inverse Markov Decision Process Approach

14 years 11 months ago

Download www.ie.pitt.edu

. Direct approaches, which involve asking patients various abstract questions, have significant drawbacks. We propose a new approach that infers patient preferences based on observ...

Zeynep Erkin, Matthew D. Bailey, Lisa M. Maillart,...

claim paper

Read More »

106

click to vote

DSN
2006
IEEE

151views Computer Networks» more DSN 2006»

Automatic Recovery Using Bounded Partially Observable Markov Decision Processes

15 years 7 months ago

Download www.perform.csl.illinois.edu

This paper provides a technique, based on partially observable Markov decision processes (POMDPs), for building automatic recovery controllers to guide distributed system recovery...

Kaustubh R. Joshi, William H. Sanders, Matti A. Hi...

claim paper

Read More »

« Prev « First page 20 / 155 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers