Search Sciweavers | Sciweavers

166 search results - page 24 / 34

» Online model learning in adversarial Markov decision process...

110

click to vote

ICASSP
2008
IEEE

215views Signal Processing» more ICASSP 2008»

Bayesian update of dialogue state for robust dialogue systems

15 years 6 months ago

Download mi.eng.cam.ac.uk

This paper presents a new framework for accumulating beliefs in spoken dialogue systems. The technique is based on updating a Bayesian Network that represents the underlying state...

Blaise Thomson, Jost Schatzmann, Steve Young

claim paper

Read More »

click to vote

NIPS
2001

192views Information Technology» more NIPS 2001»

Predictive Representations of State

15 years 1 months ago

Download www.eecs.umich.edu

We show that states of a dynamical system can be usefully represented by multi-step, action-conditional predictions of future observations. State representations that are grounded...

Michael L. Littman, Richard S. Sutton, Satinder P....

claim paper

Read More »

click to vote

ICML
2003
IEEE

124views Machine Learning» more ICML 2003»

Exploration in Metric State Spaces

16 years 13 days ago

Download www.cis.upenn.edu

We present metric?? , a provably near-optimal algorithm for reinforcement learning in Markov decision processes in which there is a natural metric on the state space that allows t...

Sham Kakade, Michael J. Kearns, John Langford

claim paper

Read More »

click to vote

AAAI
2008

151views Intelligent Agents» more AAAI 2008»

Another Look at Search-Based Drama Management

15 years 2 months ago

Download www.aaai.org

A drama manager (DM) monitors an interactive experience, such as a computer game, and intervenes to shape the global experience so it satisfies the author's expressive goals ...

Mark J. Nelson, Michael Mateas

claim paper

Read More »

click to vote

AAAI
2006

142views Intelligent Agents» more AAAI 2006»

Learning Basis Functions in Hybrid Domains

15 years 1 months ago

Download www.aaai.org

Markov decision processes (MDPs) with discrete and continuous state and action components can be solved efficiently by hybrid approximate linear programming (HALP). The main idea ...

Branislav Kveton, Milos Hauskrecht

claim paper

Read More »

« Prev « First page 24 / 34 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers