Sciweavers

132 search results - page 21 / 27
» Rewarding Behaviors
Sort
View
AAAI
2000
15 years 3 months ago
Interactive Training for Synthetic Characters
Compelling synthetic characters must behave in ways that reflect their past experience and thus allow for individual personalization. We therefore need a method that allows charac...
Song-Yee Yoon, Robert C. Burke, Bruce Blumberg, Ge...
AGI
2011
14 years 5 months ago
Learning Problem Solving Skills from Demonstration: An Architectural Approach
We present an architectural approach to learning problem solving skills from demonstration, using internal models to represent problem-solving operational knowledge. Internal forwa...
Haris Dindo, Antonio Chella, Giuseppe La Tona, Mon...
117
Voted
ICML
2006
IEEE
16 years 2 months ago
Probabilistic inference for solving discrete and continuous state Markov Decision Processes
Inference in Markov Decision Processes has recently received interest as a means to infer goals of an observed action, policy recognition, and also as a tool to compute policies. ...
Marc Toussaint, Amos J. Storkey
ICML
2005
IEEE
16 years 2 months ago
Proto-value functions: developmental reinforcement learning
This paper presents a novel framework called proto-reinforcement learning (PRL), based on a mathematical model of a proto-value function: these are task-independent basis function...
Sridhar Mahadevan
140
Voted
ICML
1995
IEEE
16 years 2 months ago
Learning Policies for Partially Observable Environments: Scaling Up
Partially observable Markov decision processes (pomdp's) model decision problems in which an agent tries to maximize its reward in the face of limited and/or noisy sensor fee...
Michael L. Littman, Anthony R. Cassandra, Leslie P...