Search Sciweavers | Sciweavers

127 search results - page 16 / 26

» A linear approximation method for the Shapley value

click to vote

UAI
2004

195views Artificial Intelligence» more UAI 2004»

Solving Factored MDPs with Continuous and Discrete Variables

14 years 11 months ago

Download www.cs.pitt.edu

Although many real-world stochastic planning problems are more naturally formulated by hybrid models with both discrete and continuous variables, current state-of-the-art methods ...

Carlos Guestrin, Milos Hauskrecht, Branislav Kveto...

claim paper

Read More »

103

click to vote

ICML
2007
IEEE

179views Machine Learning» more ICML 2007»

A kernel path algorithm for support vector machines

15 years 10 months ago

Download www.machinelearning.org

The choice of the kernel function which determines the mapping between the input space and the feature space is of crucial importance to kernel methods. The past few years have se...

Gang Wang, Dit-Yan Yeung, Frederick H. Lochovsky

claim paper

Read More »

click to vote

CDC
2010
IEEE

139views Control Systems» more CDC 2010»

Q-learning and enhanced policy iteration in discounted dynamic programming

14 years 4 months ago

Download web.mit.edu

We consider the classical finite-state discounted Markovian decision problem, and we introduce a new policy iteration-like algorithm for finding the optimal state costs or Q-facto...

Dimitri P. Bertsekas, Huizhen Yu

claim paper

Read More »

103

click to vote

SIAMIS
2008

174views more SIAMIS 2008»

Efficient Reconstruction of Piecewise Constant Images Using Nonsmooth Nonconvex Minimization

14 years 9 months ago

Download www.cmla.ens-cachan.fr

We consider the restoration of piecewise constant images where the number of the regions and their values are not fixed in advance, with a good difference of piecewise constant val...

Mila Nikolova, Michael K. Ng, Shuqin Zhang, Wai-Ki...

claim paper

Read More »

click to vote

ICML
2008
IEEE

117views Machine Learning» more ICML 2008»

Sample-based learning and search with permanent and transient memories

15 years 10 months ago

Download www.cs.ualberta.ca

We present a reinforcement learning architecture, Dyna-2, that encompasses both samplebased learning and sample-based search, and that generalises across states during both learni...

David Silver, Martin Müller 0003, Richard S. ...

claim paper

Read More »

« Prev « First page 16 / 26 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers