Sciweavers

874 search results - page 13 / 175
» Iterative Learning Control - Monotonicity and Optimization
Sort
View
ICML
2008
IEEE
15 years 10 months ago
Learning all optimal policies with multiple criteria
We describe an algorithm for learning in the presence of multiple criteria. Our technique generalizes previous approaches in that it can learn optimal policies for all linear pref...
Leon Barrett, Srini Narayanan
CDC
2010
IEEE
139views Control Systems» more  CDC 2010»
14 years 4 months ago
Q-learning and enhanced policy iteration in discounted dynamic programming
We consider the classical finite-state discounted Markovian decision problem, and we introduce a new policy iteration-like algorithm for finding the optimal state costs or Q-facto...
Dimitri P. Bertsekas, Huizhen Yu
PRIB
2009
Springer
209views Bioinformatics» more  PRIB 2009»
15 years 4 months ago
Class Prediction from Disparate Biological Data Sources Using an Iterative Multi-Kernel Algorithm
For many biomedical modelling tasks a number of different types of data may influence predictions made by the model. An established approach to pursuing supervised learning with ...
Yiming Ying, Colin Campbell, Theodoros Damoulas, M...
SIGIR
2004
ACM
15 years 3 months ago
Document clustering via adaptive subspace iteration
Document clustering has long been an important problem in information retrieval. In this paper, we present a new clustering algorithm ASI1, which uses explicitly modeling of the s...
Tao Li, Sheng Ma, Mitsunori Ogihara
CORR
2010
Springer
105views Education» more  CORR 2010»
14 years 8 months ago
Optimism in Reinforcement Learning Based on Kullback-Leibler Divergence
We consider model-based reinforcement learning in finite Markov Decision Processes (MDPs), focussing on so-called optimistic strategies. Optimism is usually implemented by carryin...
Sarah Filippi, Olivier Cappé, Aurelien Gari...