Search Sciweavers | Sciweavers

1138 search results - page 101 / 228

» Feature Markov Decision Processes

118

Voted

NAACL
2007

125views Computational Linguistics» more NAACL 2007»

Comparing User Simulation Models For Dialog Strategy Learning

15 years 6 months ago

Download www.cs.pitt.edu

This paper explores what kind of user simulation model is suitable for developing a training corpus for using Markov Decision Processes (MDPs) to automatically learn dialog strate...

Hua Ai, Joel R. Tetreault, Diane J. Litman

claim paper

Read More »

144

click to vote

NIPS
2007

149views Information Technology» more NIPS 2007»

Online Linear Regression and Its Application to Model-Based Reinforcement Learning

15 years 6 months ago

Download books.nips.cc

We provide a provably efﬁcient algorithm for learning Markov Decision Processes (MDPs) with continuous state and action spaces in the online setting. Speciﬁcally, we take a mo...

Alexander L. Strehl, Michael L. Littman

claim paper

Read More »

121

Voted

UAI
2000

102views Artificial Intelligence» more UAI 2000»

Approximately Optimal Monitoring of Plan Preconditions

15 years 6 months ago

Download www.cs.toronto.edu

Monitoring plan preconditions can allow for replanning when a precondition fails, generally far in advance of the point in the plan where the precondition is relevant. However, mo...

Craig Boutilier

claim paper

Read More »

175

click to vote

UAI
2007

173views Artificial Intelligence» more UAI 2007»

Automatic Generation of Four-part Harmony

15 years 6 months ago

Download sunsite.informatik.rwth-aachen.de

This paper introduces decision-theoretic planning techniques into automatic music generation. Markov decision processes (MDPs) are a mathematical model of planning under uncertain...

Liangrong Yi, Judy Goldsmith

claim paper

Read More »

132

click to vote

ML
2002
ACM

146views Machine Learning» more ML 2002»

Variable Resolution Discretization in Optimal Control

15 years 4 months ago

Download www.ri.cmu.edu

Abstract. The problemof state abstractionis of centralimportancein optimalcontrol,reinforcement learning and Markov decision processes. This paper studies the case of variable reso...

Rémi Munos, Andrew W. Moore

claim paper

Read More »

« Prev « First page 101 / 228 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers