Sciweavers

48 search results - page 10 / 10
» Note on MAX 2SAT
Sort
View
AR
2006
95views more  AR 2006»
13 years 5 months ago
Adaptive body schema for robotic tool-use
The development and expression of many higher level cognitive functions, such as imitation, spatial perception, and tool-use relies on a multi-modal representation of the body kno...
Cota Nabeshima, Yasuo Kuniyoshi, Max Lungarella
ECML
2004
Springer
13 years 11 months ago
Batch Reinforcement Learning with State Importance
Abstract. We investigate the problem of using function approximation in reinforcement learning where the agent’s policy is represented as a classifier mapping states to actions....
Lihong Li, Vadim Bulitko, Russell Greiner
ICML
1996
IEEE
14 years 6 months ago
Learning Evaluation Functions for Large Acyclic Domains
Some of the most successful recent applications of reinforcement learning have used neural networks and the TD algorithm to learn evaluation functions. In this paper, we examine t...
Justin A. Boyan, Andrew W. Moore