Sciweavers

340 search results - page 66 / 68
» Kernelized value function approximation for reinforcement le...
Sort
View
ICMCS
2006
IEEE
192views Multimedia» more  ICMCS 2006»
15 years 3 months ago
Classifier Optimization for Multimedia Semantic Concept Detection
In this paper, we present an AUC (i.e., the Area Under the Curve of Receiver Operating Characteristics (ROC)) maximization based learning algorithm to design the classifier for ma...
Sheng Gao, Qibin Sun
RSS
2007
176views Robotics» more  RSS 2007»
14 years 11 months ago
Active Policy Learning for Robot Planning and Exploration under Uncertainty
Abstract— This paper proposes a simulation-based active policy learning algorithm for finite-horizon, partially-observed sequential decision processes. The algorithm is tested i...
Ruben Martinez-Cantin, Nando de Freitas, Arnaud Do...
79
Voted
ECML
2006
Springer
15 years 1 months ago
Prioritizing Point-Based POMDP Solvers
Recent scaling up of POMDP solvers towards realistic applications is largely due to point-based methods such as PBVI, Perseus, and HSVI, which quickly converge to an approximate so...
Guy Shani, Ronen I. Brafman, Solomon Eyal Shimony
IJCNN
2006
IEEE
15 years 3 months ago
Pattern Selection for Support Vector Regression based on Sparseness and Variability
— Support Vector Machine has been well received in machine learning community with its theoretical as well as practical value. However, since its training time complexity is cubi...
Jiyoung Sun, Sungzoon Cho
NIPS
2007
14 years 11 months ago
Multi-task Gaussian Process Prediction
In this paper we investigate multi-task learning in the context of Gaussian Processes (GP). We propose a model that learns a shared covariance function on input-dependent features...
Edwin V. Bonilla, Kian Ming Chai, Christopher K. I...