Sciweavers

48 search results - page 10 / 10
» Note on MAX 2SAT
Sort
View
67
Voted
AR
2006
95views more  AR 2006»
14 years 9 months ago
Adaptive body schema for robotic tool-use
The development and expression of many higher level cognitive functions, such as imitation, spatial perception, and tool-use relies on a multi-modal representation of the body kno...
Cota Nabeshima, Yasuo Kuniyoshi, Max Lungarella
ECML
2004
Springer
15 years 2 months ago
Batch Reinforcement Learning with State Importance
Abstract. We investigate the problem of using function approximation in reinforcement learning where the agent’s policy is represented as a classifier mapping states to actions....
Lihong Li, Vadim Bulitko, Russell Greiner
96
Voted
ICML
1996
IEEE
15 years 10 months ago
Learning Evaluation Functions for Large Acyclic Domains
Some of the most successful recent applications of reinforcement learning have used neural networks and the TD algorithm to learn evaluation functions. In this paper, we examine t...
Justin A. Boyan, Andrew W. Moore