Sciweavers

777 search results - page 89 / 156
» Learning dynamic algorithm portfolios
Sort
View
ECML
2003
Springer
15 years 8 months ago
Self-evaluated Learning Agent in Multiple State Games
Abstract. Most of multi-agent reinforcement learning algorithms aim to converge to a Nash equilibrium, but a Nash equilibrium does not necessarily mean a desirable result. On the o...
Koichi Moriyama, Masayuki Numao
IJON
2010
119views more  IJON 2010»
15 years 1 months ago
Hyperparameter learning in probabilistic prototype-based models
We present two approaches to extend Robust Soft Learning Vector Quantization (RSLVQ). This algorithm for nearest prototype classification is derived from an explicit cost functio...
Petra Schneider, Michael Biehl, Barbara Hammer
AAAI
2011
14 years 3 months ago
Fast Newton-CG Method for Batch Learning of Conditional Random Fields
We propose a fast batch learning method for linearchain Conditional Random Fields (CRFs) based on Newton-CG methods. Newton-CG methods are a variant of Newton method for high-dime...
Yuta Tsuboi, Yuya Unno, Hisashi Kashima, Naoaki Ok...
ICDM
2006
IEEE
137views Data Mining» more  ICDM 2006»
15 years 9 months ago
Mining Complex Time-Series Data by Learning Markovian Models
In this paper, we propose a novel and general approach for time-series data mining. As an alternative to traditional ways of designing specific algorithm to mine certain kind of ...
Yi Wang, Lizhu Zhou, Jianhua Feng, Jianyong Wang, ...
ROBIO
2006
IEEE
129views Robotics» more  ROBIO 2006»
15 years 9 months ago
Learning Utility Surfaces for Movement Selection
— Humanoid robots are highly redundant systems with respect to the tasks they are asked to perform. This redundancy manifests itself in the number of degrees of freedom of the ro...
Matthew Howard, Michael Gienger, Christian Goerick...