Search Sciweavers | Sciweavers

53 search results - page 10 / 11

» Expectation Propagation for approximate Bayesian inference

click to vote

AAAI
1998

175views Intelligent Agents» more AAAI 1998»

Bayesian Q-Learning

13 years 6 months ago

Download www.aaai.org

A central problem in learning in complex environmentsis balancing exploration of untested actions against exploitation of actions that are known to be good. The benefit of explora...

Richard Dearden, Nir Friedman, Stuart J. Russell

claim paper

Read More »

click to vote

ICML
2004
IEEE

167views Machine Learning» more ICML 2004»

Dynamic conditional random fields: factorized probabilistic models for labeling and segmenting sequence data

14 years 5 months ago

Download www.cs.umass.edu

In sequence modeling, we often wish to represent complex interaction between labels, such as when performing multiple, cascaded labeling tasks on the same sequence, or when longra...

Charles A. Sutton, Khashayar Rohanimanesh, Andrew ...

claim paper

Read More »

click to vote

JMLR
2010

125views more JMLR 2010»

Variational methods for Reinforcement Learning

12 years 11 months ago

Download jmlr.csail.mit.edu

We consider reinforcement learning as solving a Markov decision process with unknown transition distribution. Based on interaction with the environment, an estimate of the transit...

Thomas Furmston, David Barber

claim paper

Read More »

click to vote

ALT
2004
Springer

120views Machine Learning» more ALT 2004»

Relative Loss Bounds and Polynomial-Time Predictions for the k-lms-net Algorithm

14 years 1 months ago

Download www.cs.ucl.ac.uk

We consider a two-layer network algorithm. The ﬁrst layer consists of an uncountable number of linear units. Each linear unit is an LMS algorithm whose inputs are ﬁrst “kerne...

Mark Herbster

claim paper

Read More »

click to vote

WSC
2007

152views Modeling And Simulation» more WSC 2007»

New greedy myopic and existing asymptotic sequential selection procedures: preliminary empirical results

13 years 7 months ago

Download www.informs-sim.org

Statistical selection procedures can identify the best of a ﬁnite set of alternatives, where “best” is deﬁned in terms of the unknown expected value of each alternative’...

Stephen E. Chick, Jürgen Branke, Christian Sc...

claim paper

Read More »

« Prev « First page 10 / 11 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers