Sciweavers

1118 search results - page 74 / 224
» Relational temporal difference learning
Sort
View
JIIS
2002
114views more  JIIS 2002»
15 years 1 months ago
A Dynamic Probabilistic Model to Visualise Topic Evolution in Text Streams
Abstract. We propose a novel probabilistic method, based on latent variable models, for unsupervised topographic visualisation of dynamically evolving, coherent textual information...
Ata Kabán, Mark Girolami
ICML
2000
IEEE
16 years 2 months ago
Eligibility Traces for Off-Policy Policy Evaluation
Eligibility traces have been shown to speed reinforcement learning, to make it more robust to hidden states, and to provide a link between Monte Carlo and temporal-difference meth...
Doina Precup, Richard S. Sutton, Satinder P. Singh
IUI
2005
ACM
15 years 7 months ago
Adaptive teaching strategy for online learning
Finding the optimal teaching strategy for an individual student is difficult even for an experienced teacher. Identifying and incorporating multiple optimal teaching strategies fo...
Jungsoon P. Yoo, Cen Li, Chrisila C. Pettey
IROS
2007
IEEE
110views Robotics» more  IROS 2007»
15 years 7 months ago
From primitive behaviors to goal-directed behavior using affordances
— In this paper, we studied how a mobile robot equipped with a 3D laser scanner can start from primitive behaviors and learn to use them to achieve goal-directed behaviors. For t...
Mehmet Remzi Dogar, Maya Cakmak, Emre Ugur, Erol S...
AMFG
2005
IEEE
203views Biometrics» more  AMFG 2005»
15 years 7 months ago
Facial Expression Analysis Using Nonlinear Decomposable Generative Models
We present a new framework to represent and analyze dynamic facial motions using a decomposable generative model. In this paper, we consider facial expressions which lie on a one d...
Chan-Su Lee, Ahmed M. Elgammal