Search Sciweavers | Sciweavers

301 search results - page 27 / 61

» On the Optimality of Probability Estimation by Random Decisi...

107

Voted

IPCO
2010

125views Optimization» more IPCO 2010»

A Pumping Algorithm for Ergodic Stochastic Mean Payoff Games with Perfect Information

15 years 1 months ago

Download www.mpi-inf.mpg.de

Abstract. We consider two-person zero-sum stochastic mean payoff games with perfect information, or BWR-games, given by a digraph G = (V = VB VW VR, E), with local rewards r : E R...

Endre Boros, Khaled M. Elbassioni, Vladimir Gurvic...

claim paper

Read More »

102

Voted

IBPRIA
2007
Springer

161views Pattern Recognition» more IBPRIA 2007»

Random Forest for Gene Expression Based Cancer Classification: Overlooked Issues

15 years 4 months ago

Download www.ee.oulu.fi

Random forest is a collection (ensemble) of decision trees. It is a popular ensemble technique in pattern recognition. In this article, we apply random forest for cancer classifica...

Oleg Okun, Helen Priisalu

claim paper

Read More »

123

Voted

CIMCA
2008
IEEE

125views Intelligent Agents» more CIMCA 2008»

Tree Exploration for Bayesian RL Exploration

15 years 7 months ago

Download arxiv.org

Research in reinforcement learning has produced algorithms for optimal decision making under uncertainty that fall within two main types. The ﬁrst employs a Bayesian framework, ...

Christos Dimitrakakis

posted by olethros

Read More »

116

Voted

ICML
2007
IEEE

172views Machine Learning» more ICML 2007»

Conditional random fields for multi-agent reinforcement learning

16 years 1 months ago

Download www.machinelearning.org

Conditional random fields (CRFs) are graphical models for modeling the probability of labels given the observations. They have traditionally been trained with using a set of obser...

Xinhua Zhang, Douglas Aberdeen, S. V. N. Vishwanat...

claim paper

Read More »

110

click to vote

CORR
2010
Springer

105views Education» more CORR 2010»

Optimism in Reinforcement Learning Based on Kullback-Leibler Divergence

14 years 11 months ago

Download hal.archives-ouvertes.fr

We consider model-based reinforcement learning in ﬁnite Markov Decision Processes (MDPs), focussing on so-called optimistic strategies. Optimism is usually implemented by carryin...

Sarah Filippi, Olivier Cappé, Aurelien Gari...

claim paper

Read More »

« Prev « First page 27 / 61 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers