Search Sciweavers | Sciweavers

548 search results - page 14 / 110

» A New Way to Introduce Knowledge into Reinforcement Learning

132

Voted

AAAI
2006

190views Intelligent Agents» more AAAI 2006»

Action Selection in Bayesian Reinforcement Learning

15 years 3 months ago

Download www.aaai.org

My research attempts to address on-line action selection in reinforcement learning from a Bayesian perspective. The idea is to develop more effective action selection techniques b...

Tao Wang

claim paper

Read More »

146

Voted

ECML
2007
Springer

170views Machine Learning» more ECML 2007»

Sequence Labeling with Reinforcement Learning and Ranking Algorithms

15 years 3 months ago

Download nieme.lip6.fr

Many problems in areas such as Natural Language Processing, Information Retrieval, or Bioinformatic involve the generic task of sequence labeling. In many cases, the aim is to assi...

Francis Maes, Ludovic Denoyer, Patrick Gallinari

claim paper

Read More »

click to vote

ICML
2010
IEEE

171views Machine Learning» more ICML 2010»

Efficient Reinforcement Learning with Multiple Reward Functions for Randomized Controlled Trial Analysis

15 years 2 months ago

Download www.stat.lsa.umich.edu

We introduce new, efficient algorithms for value iteration with multiple reward functions and continuous state. We also give an algorithm for finding the set of all nondominated a...

Daniel J. Lizotte, Michael H. Bowling, Susan A. Mu...

claim paper

Read More »

100

click to vote

ATAL
2006
Springer

142views Intelligent Agents» more ATAL 2006»

Probabilistic policy reuse in a reinforcement learning agent

15 years 5 months ago

Download www.cs.cmu.edu

We contribute Policy Reuse as a technique to improve a reinforcement learning agent with guidance from past learned similar policies. Our method relies on using the past policies ...

Fernando Fernández, Manuela M. Veloso

claim paper

Read More »

130

click to vote

ISDA
2009
IEEE

144views Operating System» more ISDA 2009»

Postponed Updates for Temporal-Difference Reinforcement Learning

15 years 8 months ago

Download www.science.uva.nl

This paper presents postponed updates, a new strategy for TD methods that can improve sample efﬁciency without incurring the computational and space requirements of model-based ...

Harm van Seijen, Shimon Whiteson

claim paper

Read More »

« Prev « First page 14 / 110 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers