Sciweavers

1206 search results - page 131 / 242
» Convergence analysis of online algorithms
Sort
View
ICML
2007
IEEE
15 years 10 months ago
Reinforcement learning by reward-weighted regression for operational space control
Many robot control problems of practical importance, including operational space control, can be reformulated as immediate reward reinforcement learning problems. However, few of ...
Jan Peters, Stefan Schaal
INFOCOM
1998
IEEE
15 years 2 months ago
A Stochastic Approximation Approach for Max-Min Fair Adaptive Rate Control of ABR Sessions with MCRs
The ABR sessions in an ATM network share the bandwidth left over after guaranteeing service to CBR and VBR traffic. Hence the bandwidth available to ABR sessions is randomly varyi...
Santosh Paul Abraham, Anurag Kumar
MA
2010
Springer
172views Communications» more  MA 2010»
14 years 8 months ago
On Monte Carlo methods for Bayesian multivariate regression models with heavy-tailed errors
We consider Bayesian analysis of data from multivariate linear regression models whose errors have a distribution that is a scale mixture of normals. Such models are used to analy...
Vivekananda Roy, James P. Hobert
SODA
1994
ACM
88views Algorithms» more  SODA 1994»
14 years 11 months ago
Optimal Prediction for Prefetching in the Worst Case
Response time delays caused by I/O are a major problem in many systems and database applications. Prefetching and cache replacement methods are attracting renewed attention because...
P. Krishnan, Jeffrey Scott Vitter
SIAMSC
2010
116views more  SIAMSC 2010»
14 years 4 months ago
Optimized Schwarz Waveform Relaxation for the Primitive Equations of the Ocean
In this article we are interested in the derivation of efficient domain decomposition methods for the viscous primitive equations of the ocean. We consider the rotating 3d incompre...
Emmanuel Audusse, Pierre Dreyfuss, Benoit Merlet