Search Sciweavers | Sciweavers

168 search results - page 5 / 34

» Optimism in Reinforcement Learning Based on Kullback-Leibler...

Voted

PRL
2007

138views more PRL 2007»

Ent-Boost: Boosting using entropy measures for robust object detection

14 years 9 months ago

Download satoh-lab.ex.nii.ac.jp

Recently, boosting has come to be used widely in object-detection applications because of its impressive performance in both speed and accuracy. However, learning weak classiﬁer...

Duy-Dinh Le, Shin'ichi Satoh

claim paper

Read More »

click to vote

CVPR
2008
IEEE

240views Computer Vision» more CVPR 2008»

Regression from patch-kernel

15 years 11 months ago

Download www.lv-nus.org

In this paper, we present a patch-based regression framework for addressing the human age and head pose estimation problems. Firstly, each image is encoded as an ensemble of order...

Shuicheng Yan, Xi Zhou, Ming Liu, Mark Hasegawa-Jo...

claim paper

Read More »

click to vote

AI
2006
Springer

103views Artificial Intelligence» more AI 2006»

Trace Equivalence Characterization Through Reinforcement Learning

15 years 1 months ago

Download www2.ift.ulaval.ca

In the context of probabilistic verification, we provide a new notion of trace-equivalence divergence between pairs of Labelled Markov processes. This divergence corresponds to the...

Josee Desharnais, François Laviolette, Kris...

claim paper

Read More »

Voted

ECML
2004
Springer

112views Machine Learning» more ECML 2004»

Convergence and Divergence in Standard and Averaging Reinforcement Learning

15 years 2 months ago

Download igitur-archive.library.uu.nl

Although tabular reinforcement learning (RL) methods have been proved to converge to an optimal policy, the combination of particular conventional reinforcement learning techniques...

Marco Wiering

claim paper

Read More »

109

click to vote

IWLCS
2005
Springer

161views Machine Learning» more IWLCS 2005»

Counter Example for Q-Bucket-Brigade Under Prediction Problem

15 years 3 months ago

Download www.cs.bham.ac.uk

Aiming to clarify the convergence or divergence conditions for Learning Classiﬁer System (LCS), this paper explores: (1) an extreme condition where the reinforcement process of ...

Atsushi Wada, Keiki Takadama, Katsunori Shimohara

claim paper

Read More »

« Prev « First page 5 / 34 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers