Sciweavers

335 search results - page 47 / 67
» Learning Simulation Control in General Game-Playing Agents
Sort
View
ATAL
2009
Springer
15 years 4 months ago
Online exploration in least-squares policy iteration
One of the key problems in reinforcement learning is balancing exploration and exploitation. Another is learning and acting in large or even continuous Markov decision processes (...
Lihong Li, Michael L. Littman, Christopher R. Mans...
CDC
2009
IEEE
139views Control Systems» more  CDC 2009»
15 years 2 months ago
A bio-plausible design for visual attitude stabilization
— We consider the problem of attitude stabilization using exclusively visual sensory input, and we look for a solution which can satisfy the constraints of a “bio-plausible” ...
Andrea Censi, Shuo Han, Sawyer B. Fuller, Richard ...
AAAI
1994
14 years 11 months ago
GENET: A Connectionist Architecture for Solving Constraint Satisfaction Problems by Iterative Improvement
New approaches to solving constraint satisfaction problems using iterative improvement techniques have been found to be successful on certain, very large problems such as the mill...
Andrew J. Davenport, Edward P. K. Tsang, Chang J. ...
IROS
2006
IEEE
168views Robotics» more  IROS 2006»
15 years 3 months ago
Learning to Drive Among Obstacles
— This paper reports on an outdoor mobile robot that learns to avoid collisions by observing a human driver operate a vehicle equipped with sensors that continuously produce a ma...
Bradley Hamner, Sebastian Scherer, Sanjiv Singh
ITS
1992
Springer
152views Multimedia» more  ITS 1992»
15 years 1 months ago
People Power: A Human-Computer Collaborative Learning System
Abstract. This paper reports our research work in the new field of humancomputer collaborative learning (HCCL). The general architecture of an HCCL is defined. An HCCL system, call...
Pierre Dillenbourg, John A. Self