Sciweavers

194 search results - page 9 / 39
» Sequence Labeling with Reinforcement Learning and Ranking Al...
Sort
View
GECCO
2009
Springer
162views Optimization» more  GECCO 2009»
14 years 9 months ago
Uncertainty handling CMA-ES for reinforcement learning
The covariance matrix adaptation evolution strategy (CMAES) has proven to be a powerful method for reinforcement learning (RL). Recently, the CMA-ES has been augmented with an ada...
Verena Heidrich-Meisner, Christian Igel
WSDM
2012
ACM
214views Data Mining» more  WSDM 2012»
13 years 7 months ago
Selecting actions for resource-bounded information extraction using reinforcement learning
Given a database with missing or uncertain content, our goal is to correct and fill the database by extracting specific information from a large corpus such as the Web, and to d...
Pallika H. Kanani, Andrew K. McCallum
CIVR
2007
Springer
15 years 5 months ago
Semantics reinforcement and fusion learning for multimedia streams
Fusion of multimedia streams for enhanced performance is a critical problem for retrieval. However, fusion performance tends to easily overfit the hillclimb set used to learn fus...
Dhiraj Joshi, Milind R. Naphade, Apostol Natsev
CVPR
2010
IEEE
14 years 12 months ago
Label propagation in video sequences
This paper proposes a probabilistic graphical model for the problem of propagating labels in video sequences, also termed the label propagation problem. Given a limited amount of ...
Vijay Badrinarayanan, Fabio Galasso, Roberto Cipol...
ML
2010
ACM
141views Machine Learning» more  ML 2010»
14 years 10 months ago
Relational retrieval using a combination of path-constrained random walks
Scientific literature with rich metadata can be represented as a labeled directed graph. This graph representation enables a number of scientific tasks such as ad hoc retrieval o...
Ni Lao, William W. Cohen