Search Sciweavers | Sciweavers

397 search results - page 16 / 80

» Reinforcement Learning with Hierarchies of Machines

click to vote

ECML
2006
Springer

88views Machine Learning» more ECML 2006»

Reinforcement Learning for MDPs with Constraints

14 years 11 months ago

Download www.peter-geibel.de

In this article, I will consider Markov Decision Processes with two criteria, each defined as the expected value of an infinite horizon cumulative return. The second criterion is e...

Peter Geibel

claim paper

Read More »

click to vote

NIPS
2004

137views Information Technology» more NIPS 2004»

Brain Inspired Reinforcement Learning

14 years 11 months ago

Download books.nips.cc

Successful application of reinforcement learning algorithms often involves considerable hand-crafting of the necessary non-linear features to reduce the complexity of the value fu...

François Rivest, Yoshua Bengio, John Kalask...

claim paper

Read More »

Voted

ICML
2009
IEEE

172views Machine Learning» more ICML 2009»

Model-free reinforcement learning as mixture learning

15 years 10 months ago

Download user.cs.tu-berlin.de

We cast model-free reinforcement learning as the problem of maximizing the likelihood of a probabilistic mixture model via sampling, addressing both the infinite and finite horizo...

Nikos Vlassis, Marc Toussaint

claim paper

Read More »

click to vote

ICML
2009
IEEE

160views Machine Learning» more ICML 2009»

The adaptive k-meteorologists problem and its application to structure learning and feature selection in reinforcement learning

15 years 10 months ago

Download www.research.rutgers.edu

The purpose of this paper is three-fold. First, we formalize and study a problem of learning probabilistic concepts in the recently proposed KWIK framework. We give details of an ...

Carlos Diuk, Lihong Li, Bethany R. Leffler

claim paper

Read More »

click to vote

AIMSA
2006
Springer

159views Artificial Intelligence» more AIMSA 2006»

Machine Learning for Spoken Dialogue Management: An Experiment with Speech-Based Database Querying

15 years 1 months ago

Download tcts.fpms.ac.be

Although speech and language processing techniques achieved a relative maturity during the last decade, designing a spoken dialogue system is still a tailoring task because of the ...

Olivier Pietquin

claim paper

Read More »

« Prev « First page 16 / 80 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers