Sciweavers

121 search results - page 16 / 25
» Learning Decision Theoretic Utilities through Reinforcement ...
Sort
View
ECCV
2010
Springer
15 years 1 months ago
Discriminative Tracking by Metric Learning
We present a discriminative model that casts appearance modeling and visual matching into a single objective for visual tracking. Most previous discriminative models for visual tra...
AAMAS
2005
Springer
14 years 9 months ago
Cooperative Multi-Agent Learning: The State of the Art
Cooperative multi-agent systems are ones in which several agents attempt, through their interaction, to jointly solve tasks or to maximize utility. Due to the interactions among t...
Liviu Panait, Sean Luke
72
Voted
ECAI
2000
Springer
15 years 1 months ago
Learning to Use Operational Advice
We address the problem of advice-taking in a given domain, in particular for building a game-playing program. Our approach to solving it strives for the application of machine lea...
Johannes Fürnkranz, Bernhard Pfahringer, Herm...
ECML
2007
Springer
15 years 3 months ago
Policy Gradient Critics
We present Policy Gradient Actor-Critic (PGAC), a new model-free Reinforcement Learning (RL) method for creating limited-memory stochastic policies for Partially Observable Markov ...
Daan Wierstra, Jürgen Schmidhuber
AR
2002
157views more  AR 2002»
14 years 9 months ago
Acquiring state from control dynamics to learn grasping policies for robot hands
Abstract--A prominent emerging theory of sensorimotor development in biological systems proposes that control knowledge is encoded in the dynamics of physical interaction with the ...
Roderic A. Grupen, Jefferson A. Coelho Jr.