Sciweavers

1118 search results - page 90 / 224
» Relational temporal difference learning
Sort
View
NECO
2007
258views more  NECO 2007»
15 years 29 days ago
Reinforcement Learning Through Modulation of Spike-Timing-Dependent Synaptic Plasticity
The persistent modification of synaptic efficacy as a function of the relative timing of pre- and postsynaptic spikes is a phenomenon known as spiketiming-dependent plasticity (...
Razvan V. Florian
120
Voted
CVIU
2006
110views more  CVIU 2006»
15 years 1 months ago
Matching actions in presence of camera motion
When the camera viewing an action is moving, the motion observed in the video not only contains the motion of the actor but also the motion of the camera. At each time instant, in...
Alper Yilmaz, Mubarak Shah
LREC
2008
125views Education» more  LREC 2008»
15 years 2 months ago
Towards Formal Interpretation of Semantic Annotation
In this paper we present a novel approach to the incremental incorporation of semantic information in natural language processing which does not fall victim to the notorious probl...
Harry Bunt, Chwhynny Overbeeke
ATAL
2008
Springer
15 years 3 months ago
Sigma point policy iteration
In reinforcement learning, least-squares temporal difference methods (e.g., LSTD and LSPI) are effective, data-efficient techniques for policy evaluation and control with linear v...
Michael H. Bowling, Alborz Geramifard, David Winga...
CVPR
2011
IEEE
14 years 9 months ago
Random Field Topic Model for Semantic Region Analysis in Crowded Scenes from Tracklets
In this paper, a Random Field Topic (RFT) model is proposed for semantic region analysis from motions of objects in crowded scenes. Different from existing approaches of learning ...
Bolei Zhou, Xiaogang Wang