Sciweavers

48 search results - page 10 / 10
» Note on MAX 2SAT
Sort
View
79
Voted
AR
2006
95views more  AR 2006»
14 years 11 months ago
Adaptive body schema for robotic tool-use
The development and expression of many higher level cognitive functions, such as imitation, spatial perception, and tool-use relies on a multi-modal representation of the body kno...
Cota Nabeshima, Yasuo Kuniyoshi, Max Lungarella
ECML
2004
Springer
15 years 5 months ago
Batch Reinforcement Learning with State Importance
Abstract. We investigate the problem of using function approximation in reinforcement learning where the agent’s policy is represented as a classifier mapping states to actions....
Lihong Li, Vadim Bulitko, Russell Greiner
109
Voted
ICML
1996
IEEE
16 years 14 days ago
Learning Evaluation Functions for Large Acyclic Domains
Some of the most successful recent applications of reinforcement learning have used neural networks and the TD algorithm to learn evaluation functions. In this paper, we examine t...
Justin A. Boyan, Andrew W. Moore