Search Sciweavers | Sciweavers

40 search results - page 7 / 8

» Learning Partially Observable Action Schemas

click to vote

ICASSP
2008
IEEE

215views Signal Processing» more ICASSP 2008»

Bayesian update of dialogue state for robust dialogue systems

13 years 11 months ago

Download mi.eng.cam.ac.uk

This paper presents a new framework for accumulating beliefs in spoken dialogue systems. The technique is based on updating a Bayesian Network that represents the underlying state...

Blaise Thomson, Jost Schatzmann, Steve Young

claim paper

Read More »

click to vote

NIPS
2001

192views Information Technology» more NIPS 2001»

Predictive Representations of State

13 years 6 months ago

Download www.eecs.umich.edu

We show that states of a dynamical system can be usefully represented by multi-step, action-conditional predictions of future observations. State representations that are grounded...

Michael L. Littman, Richard S. Sutton, Satinder P....

claim paper

Read More »

click to vote

COLT
2005
Springer

128views Machine Learning» more COLT 2005»

From External to Internal Regret

13 years 7 months ago

Download www.cs.cmu.edu

External regret compares the performance of an online algorithm, selecting among N actions, to the performance of the best of those actions in hindsight. Internal regret compares ...

Avrim Blum, Yishay Mansour

claim paper

Read More »

click to vote

AI
2006
Springer

160views Artificial Intelligence» more AI 2006»

Satisfaction Equilibrium: Achieving Cooperation in Incomplete Information Games

13 years 9 months ago

Download www.cs.cmu.edu

So far, most equilibrium concepts in game theory require that the rewards and actions of the other agents are known and/or observed by all agents. However, in real life problems, a...

Stéphane Ross, Brahim Chaib-draa

claim paper

Read More »

click to vote

NIPS
2007

207views Information Technology» more NIPS 2007»

Bayes-Adaptive POMDPs

13 years 6 months ago

Download books.nips.cc

Bayesian Reinforcement Learning has generated substantial interest recently, as it provides an elegant solution to the exploration-exploitation trade-off in reinforcement learning...

Stéphane Ross, Brahim Chaib-draa, Joelle Pi...

claim paper

Read More »

« Prev « First page 7 / 8 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers