Search Sciweavers | Sciweavers

1408 search results - page 163 / 282

» Dynamical Tensor Approximation

194

click to vote

NIPS
1996

192views Information Technology» more NIPS 1996»

Multidimensional Triangulation and Interpolation for Reinforcement Learning

15 years 8 months ago

Download www.cs.cmu.edu

Dynamic Programming, Q-learning and other discrete Markov Decision Process solvers can be applied to continuous d-dimensional state-spaces by quantizing the state space into an arr...

Scott Davies

claim paper

Read More »

187

click to vote

PODS
2004
ACM

148views Database» more PODS 2004»

Deterministic Wavelet Thresholding for Maximum-Error Metrics

16 years 6 months ago

Download www.softnet.tuc.gr

Several studies have demonstrated the effectiveness of the wavelet decomposition as a tool for reducing large amounts of data down to compact wavelet synopses that can be used to ...

Minos N. Garofalakis, Amit Kumar

claim paper

Read More »

189

click to vote

ORL
2006

105views more ORL 2006»

Inventory placement in acyclic supply chain networks

15 years 6 months ago

Download www.bschool.nus.edu.sg

The strategic safety stock placement problem is a constrained separable concave minimization problem and so is solvable, in principle, as a sequence of mixed-integer programming p...

Thomas L. Magnanti, Zuo-Jun Max Shen, Jia Shu, Dav...

claim paper

Read More »

193

click to vote

WINE
2009
Springer

171views Economy» more WINE 2009»

The Impact of Social Ignorance on Weighted Congestion Games

16 years 1 months ago

Download www.softlab.ntua.gr

We consider weighted linear congestion games, and investigate how social ignorance, namely lack of information about the presence of some players, affects the inefﬁciency of pure...

Dimitris Fotakis, Vasilis Gkatzelis, Alexis C. Kap...

claim paper

Read More »

205

click to vote

JMLR
2006

124views more JMLR 2006»

Policy Gradient in Continuous Time

15 years 6 months ago

Download hal.inria.fr

Policy search is a method for approximately solving an optimal control problem by performing a parametric optimization search in a given class of parameterized policies. In order ...

Rémi Munos

claim paper

Read More »

« Prev « First page 163 / 282 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers