Sciweavers

1340 search results - page 100 / 268
» Kalman Temporal Differences
Sort
View
ML
1998
ACM
136views Machine Learning» more  ML 1998»
15 years 3 months ago
Co-Evolution in the Successful Learning of Backgammon Strategy
Following Tesauro’s work on TD-Gammon, we used a 4000 parameter feed-forward neural network to develop a competitive backgammon evaluation function. Play proceeds by a roll of t...
Jordan B. Pollack, Alan D. Blair
ICONIP
2009
15 years 1 months ago
Tracking in Reinforcement Learning
Reinforcement learning induces non-stationarity at several levels. Adaptation to non-stationary environments is of course a desired feature of a fair RL algorithm. Yet, even if the...
Matthieu Geist, Olivier Pietquin, Gabriel Fricout
123
Voted
MMS
2012
13 years 6 months ago
Privacy-sensitive recognition of group conversational context with sociometers
Recognizing the conversational context in which group interactions unfold has applications in machines that support collaborative work and perform automatic social inference using ...
Dinesh Babu Jayagopi, Taemie Kim, Alex Pentland, D...
140
Voted
ICPR
2008
IEEE
16 years 5 months ago
Spatio-temporal 3D pose estimation and tracking of human body parts using the Shape Flow algorithm
In this contribution we introduce the Shape Flow algorithm (SF), a novel method for spatio-temporal 3D pose estimation of a 3D parametric curve. The SF is integrated into a tracki...
Markus Hahn, Lars Krüger, Christian Wöhl...
SIGCOMM
2009
ACM
15 years 10 months ago
White space networking with wi-fi like connectivity
Networking over UHF white spaces is fundamentally different from conventional Wi-Fi along three axes: spatial variation, temporal variation, and fragmentation of the UHF spectrum....
Paramvir Bahl, Ranveer Chandra, Thomas Moscibroda,...