Search Sciweavers | Sciweavers

1239 search results - page 6 / 248

» Communication for Improving Policy Computation in Distribute...

175

click to vote

ICML
1994
IEEE

151views Machine Learning» more ICML 1994»

Learning Without State-Estimation in Partially Observable Markovian Decision Processes

15 years 9 months ago

Download www.eecs.umich.edu

Reinforcement learning (RL) algorithms provide a sound theoretical basis for building learning control architectures for embedded agents. Unfortunately all of the theory and much ...

Satinder P. Singh, Tommi Jaakkola, Michael I. Jord...

claim paper

Read More »

169

click to vote

SIGDIAL
2010

98views Natural Language Processing» more SIGDIAL 2010»

Investigating Clarification Strategies in a Hybrid POMDP Dialog Manager

15 years 4 months ago

Download www.sigdial.org

We investigate the clarification strategies exhibited by a hybrid POMDP dialog manager based on data obtained from a phone-based user study. The dialog manager combines task struc...

Sebastian Varges, Silvia Quarteroni, Giuseppe Ricc...

claim paper

Read More »

186

click to vote

CSL
2010
Springer

238views Automated Reasoning» more CSL 2010»

Bayesian update of dialogue state: A POMDP framework for spoken dialogue systems

15 years 6 months ago

Download mi.eng.cam.ac.uk

This paper describes a statistically motivated framework for performing real-time dialogue state updates and policy learning in a spoken dialogue system. The framework is based on...

Blaise Thomson, Steve Young

claim paper

Read More »

135

click to vote

ATAL
2008
Springer

105views Intelligent Agents» more ATAL 2008»

Value-based observation compression for DEC-POMDPs

15 years 8 months ago

Download www.ifaamas.org

Representing agent policies compactly is essential for improving the scalability of multi-agent planning algorithms. In this paper, we focus on developing a pruning technique that...

Alan Carlin, Shlomo Zilberstein

claim paper

Read More »

185

click to vote

IEEEPACT
2008
IEEE

136views Distributed And Parallel Com...» more IEEEPACT 2008»

Feature selection and policy optimization for distributed instruction placement using reinforcement learning

16 years 15 days ago

Download userweb.cs.utexas.edu

Communication overheads are one of the fundamental challenges in a multiprocessor system. As the number of processors on a chip increases, communication overheads and the distribu...

Katherine E. Coons, Behnam Robatmili, Matthew E. T...

claim paper

Read More »

« Prev « First page 6 / 248 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers