Sciweavers

4578 search results - page 578 / 916
» Learning from Multi-source Data
Sort
View
NIPS
2001
15 years 7 months ago
Model-Free Least-Squares Policy Iteration
We propose a new approach to reinforcement learning which combines least squares function approximation with policy iteration. Our method is model-free and completely off policy. ...
Michail G. Lagoudakis, Ronald Parr
UAI
1997
15 years 7 months ago
Update Rules for Parameter Estimation in Bayesian Networks
This paper re-examines the problem of parameter estimation in Bayesian networks with missing values and hidden variables from the perspective of recent work in on-line learning [1...
Eric Bauer, Daphne Koller, Yoram Singer
AR
2007
111views more  AR 2007»
15 years 6 months ago
Acquisition of joint attention through natural interaction utilizing motion cues
Joint attention is one of the most important cognitive functions for the emergence of communication not only between humans but also between humans and robots. In the previous wor...
Hidenobu Sumioka, Koh Hosoda, Yuichiro Yoshikawa, ...
ETS
2002
IEEE
121views Hardware» more  ETS 2002»
15 years 6 months ago
Establishing Connections: Interactivity Factors for a Distance Education Course
Both academic institutions and businesses are exploring a shift from face-to-face instruction to distance learning. However, without the foundation of a systematic instructional d...
Diane Berger Ehrlich
ACL
2007
15 years 7 months ago
Bootstrapping a Stochastic Transducer for Arabic-English Transliteration Extraction
We propose a bootstrapping approach to training a memoriless stochastic transducer for the task of extracting transliterations from an English-Arabic bitext. The transducer learns...
Tarek Sherif, Grzegorz Kondrak