Search Sciweavers | Sciweavers

106 search results - page 14 / 22

» Performance Bounded Reinforcement Learning in Strategic Inte...

249

click to vote

ACL
2010

176views Computational Linguistics» more ACL 2010»

Learning to Adapt to Unknown Users: Referring Expression Generation in Spoken Dialogue Systems

15 years 5 months ago

Download aclweb.org

We present a data-driven approach to learn user-adaptive referring expression generation (REG) policies for spoken dialogue systems. Referring expressions can be difficult to unde...

Srinivasan Janarthanam, Oliver Lemon

claim paper

Read More »

230

click to vote

ATAL
2008
Springer

128views Intelligent Agents» more ATAL 2008»

Simultaneously modeling humans' preferences and their beliefs about others' preferences

15 years 9 months ago

Download www.eecs.harvard.edu

In strategic multiagent decision making, it is often the case that a strategic reasoner must hold beliefs about other agents and use these beliefs to inform its decision making. T...

Sevan G. Ficici, Avi Pfeffer

claim paper

Read More »

221

click to vote

ATAL
2004
Springer

149views Intelligent Agents» more ATAL 2004»

Learning User Preferences for Wireless Services Provisioning

16 years 1 months ago

Download people.csail.mit.edu

The problem of interest is how to dynamically allocate wireless access services in a competitive market which implements a take-it-or-leave-it allocation mechanism. In this paper ...

George Lee, Steven Bauer, Peyman Faratin, John Wro...

claim paper

Read More »

200

click to vote

NIPS
1993

123views Information Technology» more NIPS 1993»

Temporal Difference Learning of Position Evaluation in the Game of Go

15 years 9 months ago

Download www.gatsby.ucl.ac.uk

The game of Go has a high branching factor that defeats the tree search approach used in computer chess, and long-range spatiotemporal interactions that make position evaluation e...

Nicol N. Schraudolph, Peter Dayan, Terrence J. Sej...

claim paper

Read More »

191

click to vote

ICML
2009
IEEE

155views Machine Learning» more ICML 2009»

Near-Bayesian exploration in polynomial time

16 years 8 months ago

Download ai.stanford.edu

We consider the exploration/exploitation problem in reinforcement learning (RL). The Bayesian approach to model-based RL offers an elegant solution to this problem, by considering...

J. Zico Kolter, Andrew Y. Ng

claim paper

Read More »

« Prev « First page 14 / 22 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers