Search Sciweavers | Sciweavers

682 search results - page 13 / 137

» One-Counter Markov Decision Processes

176

click to vote

UAI
1998

91views Artificial Intelligence» more UAI 1998»

Hierarchical Solution of Markov Decision Processes using Macro-actions

15 years 7 months ago

Download www.cs.toronto.edu

tigate the use of temporally abstract actions, or macro-actions, in the solution of Markov decision processes. Unlike current models that combine both primitive actions and macro-...

Milos Hauskrecht, Nicolas Meuleau, Leslie Pack Kae...

claim paper

Read More »

196

click to vote

CSL
2007
Springer

126views Automated Reasoning» more CSL 2007»

Partially observable Markov decision processes for spoken dialog systems

15 years 6 months ago

Download mi.eng.cam.ac.uk

In a spoken dialog system, determining which action a machine should take in a given situation is a diﬃcult problem because automatic speech recognition is unreliable and hence ...

Jason D. Williams, Steve Young

claim paper

Read More »

143

click to vote

ALT
2008
Springer

141views Machine Learning» more ALT 2008»

Online Regret Bounds for Markov Decision Processes with Deterministic Transitions

16 years 2 months ago

Download personal.unileoben.ac.at

Abstract. We consider an upper conﬁdence bound algorithm for Markov decision processes (MDPs) with deterministic transitions. For this algorithm we derive upper bounds on the onl...

Ronald Ortner

claim paper

Read More »

168

click to vote

ECAI
2008
Springer

158views Artificial Intelligence» more ECAI 2008»

A Simulation-based Approach for Solving Generalized Semi-Markov Decision Processes

15 years 7 months ago

Download emmanuel.rachelson.free.fr

Time is a crucial variable in planning and often requires special attention since it introduces a specific structure along with additional complexity, especially in the case of dec...

Emmanuel Rachelson, Gauthier Quesnel, Fréd&...

claim paper

Read More »

168

click to vote

IJCAI
2007

201views Artificial Intelligence» more IJCAI 2007»

Using Linear Programming for Bayesian Exploration in Markov Decision Processes

15 years 7 months ago

Download www.cs.mcgill.ca

A key problem in reinforcement learning is ﬁnding a good balance between the need to explore the environment and the need to gain rewards by exploiting existing knowledge. Much ...

Pablo Samuel Castro, Doina Precup

claim paper

Read More »

« Prev « First page 13 / 137 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers