Sciweavers

486 search results - page 95 / 98
» A Bayesian Framework for Reinforcement Learning
Sort
View
CVPR
2012
IEEE
13 years 12 hour ago
Nonparametric discovery of activity patterns from video collections
We propose a nonparametric framework based on the beta process for discovering temporal patterns within a heterogenous video collection. Starting from quantized local motion descr...
Michael C. Hughes, Erik B. Sudderth
CIKM
2008
Springer
14 years 11 months ago
Active relevance feedback for difficult queries
Relevance feedback has been demonstrated to be an effective strategy for improving retrieval accuracy. The existing relevance feedback algorithms based on language models and vect...
Zuobing Xu, Ram Akella
ICRA
2003
IEEE
165views Robotics» more  ICRA 2003»
15 years 2 months ago
Multi-robot task-allocation through vacancy chains
Existing task allocation algorithms generally do not consider the effects of task interaction, such as interference, but instead assume that tasks are independent. That assumptio...
Torbjørn S. Dahl, Maja J. Mataric, Gaurav S...
JMLR
2006
124views more  JMLR 2006»
14 years 9 months ago
Policy Gradient in Continuous Time
Policy search is a method for approximately solving an optimal control problem by performing a parametric optimization search in a given class of parameterized policies. In order ...
Rémi Munos
SIGIR
2011
ACM
14 years 12 days ago
Social context summarization
We study a novel problem of social context summarization for Web documents. Traditional summarization research has focused on extracting informative sentences from standard docume...
Zi Yang, Keke Cai, Jie Tang, Li Zhang, Zhong Su, J...