Sciweavers

1118 search results - page 76 / 224
» Relational temporal difference learning
Sort
View
ICPR
2006
IEEE
16 years 2 months ago
Robust Recursive Learning for Foreground Region Detection in Videos with Quasi-Stationary Backgrounds
Detecting regions of interest in video sequences is the most important task in many high level video processing applications. In this paper a robust technique based on recursive l...
Alireza Tavakkoli, George Bebis, Mircea Nicolescu
AAAI
2006
15 years 2 months ago
Sample-Efficient Evolutionary Function Approximation for Reinforcement Learning
Reinforcement learning problems are commonly tackled with temporal difference methods, which attempt to estimate the agent's optimal value function. In most real-world proble...
Shimon Whiteson, Peter Stone
CVPR
2009
IEEE
16 years 8 months ago
Learning sign language by watching TV (using weakly aligned subtitles)
The goal of this work is to automatically learn a large number of British Sign Language (BSL) signs from TV broadcasts. We achieve this by using the supervisory information avai...
Patrick Buehler (University of Oxford), Mark Everi...
IJON
2006
90views more  IJON 2006»
15 years 1 months ago
Reinforcement learning of a simple control task using the spike response model
In this work, we propose a variation of a direct reinforcement learning algorithm, suitable for usage with spiking neurons based on the spike response model (SRM). The SRM is a bi...
Murilo Saraiva de Queiroz, Roberto Coelho de Berr&...
ECTEL
2006
Springer
15 years 5 months ago
Evaluation of Virtual Learning Environments Using Logs and Social Networks
The paper presents an evaluation method for e-learning platforms, based on different types of measurements collected in logs of interactions during learning sessions, and on the an...
Vlad Posea, Dan Mihaila, Stefan Trausan-Matu, Vale...