Sciweavers

274 search results - page 44 / 55
» Network reinforcement
Sort
View
AI50
2006
15 years 1 months ago
Adaptive Multi-modal Sensors
Compressing real-time input through bandwidth constrained connections has been studied within robotics, wireless sensor networks, and image processing. When there are bandwidth con...
Kyle Ira Harrington, Hava T. Siegelmann
NIPS
2008
14 years 11 months ago
Temporal Difference Based Actor Critic Learning - Convergence and Neural Implementation
Actor-critic algorithms for reinforcement learning are achieving renewed popularity due to their good convergence properties in situations where other approaches often fail (e.g.,...
Dotan Di Castro, Dmitry Volkinshtein, Ron Meir
BCEC
1997
14 years 11 months ago
Adaptive Task Allocation Inspired by a Model of Division of Labor in Social Insects
Social insects provide us with a powerful metaphor to create decentralized systems of simple interacting, and often mobile, agents. The emergent collective intelligence of social i...
Eric Bonabeau, Andrej Sobkowski, Guy Theraulaz, Je...
72
Voted
NIPS
1997
14 years 11 months ago
Generalized Prioritized Sweeping
Prioritized sweeping is a model-based reinforcement learning method that attempts to focus an agent’s limited computational resources to achieve a good estimate of the value of ...
David Andre, Nir Friedman, Ronald Parr
AROBOTS
1998
111views more  AROBOTS 1998»
14 years 9 months ago
Emergence and Categorization of Coordinated Visual Behavior Through Embodied Interaction
This paper discusses the emergence of sensorimotor coordination for ESCHeR, a 4DOF redundant foveated robot-head, by interaction with its environment. A feedback-error-learning(FEL...
Luc Berthouze, Yasuo Kuniyoshi