Sciweavers

417 search results - page 38 / 84
» Reinforcement Learning Estimation of Distribution Algorithm
Sort
View
ML
2007
ACM
127views Machine Learning» more  ML 2007»
15 years 1 months ago
Density estimation with stagewise optimization of the empirical risk
We consider multivariate density estimation with identically distributed observations. We study a density estimator which is a convex combination of functions in a dictionary and ...
Jussi Klemelä
ECML
2004
Springer
15 years 7 months ago
Conditional Independence Trees
It has been observed that traditional decision trees produce poor probability estimates. In many applications, however, a probability estimation tree (PET) with accurate probabilit...
Harry Zhang, Jiang Su
113
Voted
FLAIRS
2004
15 years 3 months ago
Intelligent Control of Closed-Loop Sedation in Simulated ICU Patients
The intensive care unit is a challenging environment to both patient and caregiver. Continued shortages in staffing, principally in nursing, increase risk to patient and healthcar...
Brett L. Moore, Eric D. Sinzinger, Todd M. Quasny,...
NN
2010
Springer
125views Neural Networks» more  NN 2010»
15 years 23 days ago
Parameter-exploring policy gradients
We present a model-free reinforcement learning method for partially observable Markov decision problems. Our method estimates a likelihood gradient by sampling directly in paramet...
Frank Sehnke, Christian Osendorfer, Thomas Rü...
TCS
2010
15 years 21 days ago
Active learning in heteroscedastic noise
We consider the problem of actively learning the mean values of distributions associated with a finite number of options. The decision maker can select which option to generate t...
András Antos, Varun Grover, Csaba Szepesv&a...