Search Sciweavers | Sciweavers

81 search results - page 10 / 17

» Neuroevolutionary reinforcement learning for generalized hel...

click to vote

FLAIRS
2006

109views Artificial Intelligence» more FLAIRS 2006»

Refining Human Behavior Models in a Context-based Architecture

15 years 1 months ago

Download www.aaai.org

This paper describes an investigation into the refinement of context-based human behavior models through the use of experiential learning. Specifically, a tactical agent was endow...

David Aihe, Avelino J. Gonzalez

claim paper

Read More »

click to vote

ATAL
2008
Springer

123views Intelligent Agents» more ATAL 2008»

Sigma point policy iteration

15 years 1 months ago

Download web.mit.edu

In reinforcement learning, least-squares temporal difference methods (e.g., LSTD and LSPI) are effective, data-efficient techniques for policy evaluation and control with linear v...

Michael H. Bowling, Alborz Geramifard, David Winga...

claim paper

Read More »

click to vote

ICML
2008
IEEE

133views Machine Learning» more ICML 2008»

Space-indexed dynamic programming: learning to follow trajectories

16 years 15 days ago

Download www.cs.stanford.edu

We consider the task of learning to accurately follow a trajectory in a vehicle such as a car or helicopter. A number of dynamic programming algorithms such as Differential Dynami...

J. Zico Kolter, Adam Coates, Andrew Y. Ng, Yi Gu, ...

claim paper

Read More »

123

click to vote

JAIR
2007

124views more JAIR 2007»

Closed-Loop Learning of Visual Control Policies

14 years 11 months ago

Download www.jair.org

In this paper we present a general, ﬂexible framework for learning mappings from images to actions by interacting with the environment. The basic idea is to introduce a feature-...

Sébastien Jodogne, Justus H. Piater

claim paper

Read More »

111

click to vote

FLAIRS
2009

135views Artificial Intelligence» more FLAIRS 2009»

Beating the Defense: Using Plan Recognition to Inform Learning Agents

14 years 9 months ago

Download www.knexusresearch.com

In this paper, we investigate the hypothesis that plan recognition can significantly improve the performance of a casebased reinforcement learner in an adversarial action selectio...

Matthew Molineaux, David W. Aha, Gita Sukthankar

claim paper

Read More »

« Prev « First page 10 / 17 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers