Search Sciweavers | Sciweavers

651 search results - page 89 / 131

» Algorithms for Inverse Reinforcement Learning

108

click to vote

FLAIRS
2008

132views Artificial Intelligence» more FLAIRS 2008»

Learning Continuous Action Models in a Real-Time Strategy Environment

15 years 2 months ago

Download www.knexusresearch.com

Although several researchers have integrated methods for reinforcement learning (RL) with case-based reasoning (CBR) to model continuous action spaces, existing integrations typic...

Matthew Molineaux, David W. Aha, Philip Moore

claim paper

Read More »

103

click to vote

ICML
2009
IEEE

186views Machine Learning» more ICML 2009»

Regularization and feature selection in least-squares temporal difference learning

16 years 20 days ago

Download ai.stanford.edu

We consider the task of reinforcement learning with linear value function approximation. Temporal difference algorithms, and in particular the Least-Squares Temporal Difference (L...

J. Zico Kolter, Andrew Y. Ng

claim paper

Read More »

click to vote

ICML
2001
IEEE

132views Machine Learning» more ICML 2001»

Expectation Maximization for Weakly Labeled Data

16 years 20 days ago

Download characters.media.mit.edu

We call data weakly labeled if it has no exact label but rather a numerical indication of correctness of the label "guessed" by the learning algorithm - a situation comm...

Yuri A. Ivanov, Bruce Blumberg, Alex Pentland

claim paper

Read More »

click to vote

IDEAL
2000
Springer

105views Intelligent Agents» more IDEAL 2000»

Observational Learning with Modular Networks

15 years 3 months ago

Download dmlab.snu.ac.kr

Observational learning algorithm is an ensemble algorithm where each network is initially trained with a bootstrapped data set and virtual data are generated from the ensemble for ...

Hyunjung Shin, Hyoungjoo Lee, Sungzoon Cho

claim paper

Read More »

click to vote

NIPS
2003

108views Information Technology» more NIPS 2003»

Policy Search by Dynamic Programming

15 years 1 months ago

Download books.nips.cc

We consider the policy search approach to reinforcement learning. We show that if a “baseline distribution” is given (indicating roughly how often we expect a good policy to v...

J. Andrew Bagnell, Sham Kakade, Andrew Y. Ng, Jeff...

claim paper

Read More »

« Prev « First page 89 / 131 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers