Sciweavers

1419 search results - page 124 / 284
» Approximation Methods for Supervised Learning
Sort
View
162
Voted
PKDD
2010
Springer
164views Data Mining» more  PKDD 2010»
14 years 10 months ago
Efficient Planning in Large POMDPs through Policy Graph Based Factorized Approximations
Partially observable Markov decision processes (POMDPs) are widely used for planning under uncertainty. In many applications, the huge size of the POMDP state space makes straightf...
Joni Pajarinen, Jaakko Peltonen, Ari Hottinen, Mik...
NIPS
2007
15 years 2 months ago
Incremental Natural Actor-Critic Algorithms
We present four new reinforcement learning algorithms based on actor-critic and natural-gradient ideas, and provide their convergence proofs. Actor-critic reinforcement learning m...
Shalabh Bhatnagar, Richard S. Sutton, Mohammad Gha...
190
Voted
CVPR
2010
IEEE
13 years 9 months ago
Abrupt motion tracking via adaptive stochastic approximation Monte Carlo sampling
Robust tracking of abrupt motion is a challenging task in computer vision due to the large motion uncertainty. In this paper, we propose a stochastic approximation Monte Carlo (...
Xiuzhuang Zhou and Yao Lu
92
Voted
JMLR
2010
103views more  JMLR 2010»
14 years 7 months ago
Learning Nonlinear Dynamic Models from Non-sequenced Data
Virtually all methods of learning dynamic systems from data start from the same basic assumption: the learning algorithm will be given a sequence of data generated from the dynami...
Tzu-Kuo Huang, Le Song, Jeff Schneider
117
Voted
ICASSP
2011
IEEE
14 years 4 months ago
Dynamic selection of a speech enhancement method for robust speech recognition in moving motorcycle environment
We present a speech pre-processing scheme (SPPS) for robust speech recognition in the moving motorcycle environment. The SPPS is dynamically adapted during the run-time operation ...
Iosif Mporas, Todor Ganchev, Otilia Kocsis, Nikos ...