Search Sciweavers | Sciweavers

874 search results - page 13 / 175

» Iterative Learning Control - Monotonicity and Optimization

click to vote

ICML
2008
IEEE

105views Machine Learning» more ICML 2008»

Learning all optimal policies with multiple criteria

15 years 10 months ago

Download leon.barrettnexus.com

We describe an algorithm for learning in the presence of multiple criteria. Our technique generalizes previous approaches in that it can learn optimal policies for all linear pref...

Leon Barrett, Srini Narayanan

claim paper

Read More »

click to vote

CDC
2010
IEEE

139views Control Systems» more CDC 2010»

Q-learning and enhanced policy iteration in discounted dynamic programming

14 years 4 months ago

Download web.mit.edu

We consider the classical finite-state discounted Markovian decision problem, and we introduce a new policy iteration-like algorithm for finding the optimal state costs or Q-facto...

Dimitri P. Bertsekas, Huizhen Yu

claim paper

Read More »

100

click to vote

PRIB
2009
Springer

209views Bioinformatics» more PRIB 2009»

Class Prediction from Disparate Biological Data Sources Using an Iterative Multi-Kernel Algorithm

15 years 4 months ago

Download www.enm.bris.ac.uk

For many biomedical modelling tasks a number of diﬀerent types of data may inﬂuence predictions made by the model. An established approach to pursuing supervised learning with ...

Yiming Ying, Colin Campbell, Theodoros Damoulas, M...

claim paper

Read More »

click to vote

SIGIR
2004
ACM

102views Information Technology» more SIGIR 2004»

Document clustering via adaptive subspace iteration

15 years 3 months ago

Download science.kennesaw.edu

Document clustering has long been an important problem in information retrieval. In this paper, we present a new clustering algorithm ASI1, which uses explicitly modeling of the s...

Tao Li, Sheng Ma, Mitsunori Ogihara

claim paper

Read More »

click to vote

CORR
2010
Springer

105views Education» more CORR 2010»

Optimism in Reinforcement Learning Based on Kullback-Leibler Divergence

14 years 8 months ago

Download hal.archives-ouvertes.fr

We consider model-based reinforcement learning in ﬁnite Markov Decision Processes (MDPs), focussing on so-called optimistic strategies. Optimism is usually implemented by carryin...

Sarah Filippi, Olivier Cappé, Aurelien Gari...

claim paper

Read More »

« Prev « First page 13 / 175 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers