Search Sciweavers | Sciweavers

27 search results - page 2 / 6

» On the convergence of stochastic dual dynamic programming an...

click to vote

NIPS
1996

112views Information Technology» more NIPS 1996»

Exploiting Model Uncertainty Estimates for Safe Dynamic Control Learning

13 years 7 months ago

Download www.ri.cmu.edu

Model learning combined with dynamic programming has been shown to be e ective for learning control of continuous state dynamic systems. The simplest method assumes the learned mod...

Jeff G. Schneider

claim paper

Read More »

click to vote

ATAL
2008
Springer

92views Intelligent Agents» more ATAL 2008»

Stochastic search methods for nash equilibrium approximation in simulation-based games

13 years 8 months ago

Download www.seas.upenn.edu

We define the class of games called simulation-based games, in which the payoffs are available as an output of an oracle (simulator), rather than specified analytically or using a...

Yevgeniy Vorobeychik, Michael P. Wellman

claim paper

Read More »

click to vote

JMLR
2006

115views more JMLR 2006»

Structured Prediction, Dual Extragradient and Bregman Projections

13 years 6 months ago

Download www.stat.berkeley.edu

We present a simple and scalable algorithm for maximum-margin estimation of structured output models, including an important class of Markov networks and combinatorial models. We ...

Benjamin Taskar, Simon Lacoste-Julien, Michael I. ...

claim paper

Read More »

click to vote

SDM
2011
SIAM

232views Data Mining» more SDM 2011»

A Sequential Dual Method for Structural SVMs

12 years 9 months ago

Download www.keerthis.com

In many real world prediction problems the output is a structured object like a sequence or a tree or a graph. Such problems range from natural language processing to computationa...

Shirish Krishnaj Shevade, Balamurugan P., S. Sunda...

claim paper

Read More »

click to vote

IOR
2010

98views more IOR 2010»

A Shadow Simplex Method for Infinite Linear Programs

13 years 3 months ago

Download www-personal.umich.edu

We present a Simplex-type algorithm, that is, an algorithm that moves from one extreme point of the infinite-dimensional feasible region to another not necessarily adjacent extrem...

Archis Ghate, Dushyant Sharma, Robert L. Smith

claim paper

Read More »

« Prev « First page 2 / 6 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers