Search Sciweavers | Sciweavers

1863 search results - page 135 / 373

» Multiagent learning using a variable learning rate

192

click to vote

ATAL
2006
Springer

131views Intelligent Agents» more ATAL 2006»

Learning the task allocation game

15 years 10 months ago

Download dis.cs.umass.edu

The distributed task allocation problem occurs in domains like web services, the grid, and other distributed systems. In this problem, the system consists of servers and mediators...

Sherief Abdallah, Victor R. Lesser

claim paper

Read More »

180

click to vote

NECO
2010

97views more NECO 2010»

Derivatives of Logarithmic Stationary Distributions for Policy Gradient Reinforcement Learning

15 years 5 months ago

Download www.kyb.tuebingen.mpg.de

Most conventional Policy Gradient Reinforcement Learning (PGRL) algorithms neglect (or do not explicitly make use of) a term in the average reward gradient with respect to the pol...

Tetsuro Morimura, Eiji Uchibe, Junichiro Yoshimoto...

claim paper

Read More »

198

click to vote

UAI
2000

116views Artificial Intelligence» more UAI 2000»

Gaussian Process Networks

15 years 8 months ago

Download robotics.stanford.edu

In this paper we address the problem of learning the structure of a Bayesian network in domains with continuous variables. This task requires a procedure for comparing different c...

Nir Friedman, Iftach Nachman

claim paper

Read More »

180

click to vote

ICML
2003
IEEE

129views Machine Learning» more ICML 2003»

Learning on the Test Data: Leveraging Unseen Features

16 years 7 months ago

Download www.cis.upenn.edu

This paper addresses the problem of classification in situations where the data distribution is not homogeneous: Data instances might come from different locations or times, and t...

Benjamin Taskar, Ming Fai Wong, Daphne Koller

claim paper

Read More »

211

click to vote

SIGDIAL
2010

186views Natural Language Processing» more SIGDIAL 2010»

Adaptive Referring Expression Generation in Spoken Dialogue Systems: Evaluation with Real Users

15 years 4 months ago

Download www.sigdial.org

We present new results from a real-user evaluation of a data-driven approach to learning user-adaptive referring expression generation (REG) policies for spoken dialogue systems. ...

Srinivasan Janarthanam, Oliver Lemon

claim paper

Read More »

« Prev « First page 135 / 373 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers