Sciweavers

417 search results - page 69 / 84
» Reinforcement Learning Estimation of Distribution Algorithm
Sort
View
AAAI
1998
15 years 1 months ago
Bayesian Q-Learning
A central problem in learning in complex environmentsis balancing exploration of untested actions against exploitation of actions that are known to be good. The benefit of explora...
Richard Dearden, Nir Friedman, Stuart J. Russell
IROS
2008
IEEE
142views Robotics» more  IROS 2008»
15 years 6 months ago
Scaffolding on-line segmentation of full body human motion patterns
Abstract— This paper develops an approach for on-line segmentation of whole body human motion patterns during human motion observation and learning. A Hidden Markov Model is used...
Dana Kulic, Yoshihiko Nakamura
GECCO
2010
Springer
200views Optimization» more  GECCO 2010»
15 years 4 months ago
Multivariate multi-model approach for globally multimodal problems
This paper proposes an estimation of distribution algorithm (EDA) aiming at addressing globally multimodal problems, i.e., problems that present several global optima. It can be r...
Chung-Yao Chuang, Wen-Lian Hsu
MLDM
2005
Springer
15 years 5 months ago
Multivariate Discretization by Recursive Supervised Bipartition of Graph
Abstract. In supervised learning, discretization of the continuous explanatory attributes enhances the accuracy of decision tree induction algorithms and naive Bayes classifier. M...
Sylvain Ferrandiz, Marc Boullé
SDM
2010
SIAM
158views Data Mining» more  SDM 2010»
15 years 1 months ago
On the Use of Combining Rules in Relational Probability Trees
A relational probability tree (RPT) is a type of decision tree that can be used for probabilistic classification of instances with a relational structure. Each leaf of an RPT cont...
Daan Fierens