Search Sciweavers | Sciweavers

32 search results - page 6 / 7

» Batch Reinforcement Learning with State Importance

106

click to vote

IJCNN
2008
IEEE

202views Neural Networks» more IJCNN 2008»

Learning to select relevant perspective in a dynamic environment

15 years 6 months ago

Download www.cs.qub.ac.uk

— When an agent observes its environment, there are two important characteristics of the perceived information. One is the relevance of information and the other is redundancy. T...

Zhihui Luo, David A. Bell, Barry McCollum, Qingxia...

claim paper

Read More »

109

click to vote

ICML
1996
IEEE

162views Machine Learning» more ICML 1996»

Learning Evaluation Functions for Large Acyclic Domains

16 years 13 days ago

Download www.ri.cmu.edu

Some of the most successful recent applications of reinforcement learning have used neural networks and the TD algorithm to learn evaluation functions. In this paper, we examine t...

Justin A. Boyan, Andrew W. Moore

claim paper

Read More »

click to vote

ICASSP
2008
IEEE

121views Signal Processing» more ICASSP 2008»

Using dialogue acts to learn better repair strategies for spoken dialogue systems

15 years 6 months ago

Download www.stanford.edu

Repair or error-recovery strategies are an important design issue in Spoken Dialogue Systems (SDSs) - how to conduct the dialogue when there is no progress (e.g. due to repeated A...

Matthew Frampton, Oliver Lemon

claim paper

Read More »

click to vote

ECML
2007
Springer

192views Machine Learning» more ECML 2007»

Policy Gradient Critics

15 years 5 months ago

Download www.idsia.ch

We present Policy Gradient Actor-Critic (PGAC), a new model-free Reinforcement Learning (RL) method for creating limited-memory stochastic policies for Partially Observable Markov ...

Daan Wierstra, Jürgen Schmidhuber

claim paper

Read More »

click to vote

AAMAS
2010
Springer

158views Intelligent Agents» more AAMAS 2010»

Coordinated learning in multiagent MDPs with infinite state-space

14 years 11 months ago

Download gaips.inesc-id.pt

Abstract In this paper we address the problem of simultaneous learning and coordination in multiagent Markov decision problems (MMDPs) with infinite state-spaces. We separate this ...

Francisco S. Melo, M. Isabel Ribeiro

claim paper

Read More »

« Prev « First page 6 / 7 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers