Sciweavers

5075 search results - page 170 / 1015
» Convergence
Sort
View
COLT
2000
Springer
15 years 8 months ago
Estimation and Approximation Bounds for Gradient-Based Reinforcement Learning
We model reinforcement learning as the problem of learning to control a Partially Observable Markov Decision Process (  ¢¡¤£¦¥§  ), and focus on gradient ascent approache...
Peter L. Bartlett, Jonathan Baxter
ECOOP
2000
Springer
15 years 8 months ago
Using Objects for Next Generation Communication Services
The integration of the telephone network and the internet enables convergence of voice and data services. The explosion of information appliances also provides new service opportun...
Munir Cochinwala
FSTTCS
1993
Springer
15 years 8 months ago
Higher-Order and Semantic Unification
Abstract. We provide a complete system of transformation rules for semantic unification with respect to theories defined by convergent rewrite systems. We show that this standard u...
Nachum Dershowitz, Subrata Mitra
NAA
2000
Springer
88views Mathematics» more  NAA 2000»
15 years 8 months ago
Schwarz Methods for Convection-Diffusion Problems
Abstract. Various variants of Schwarz methods for a singularly perturbed two dimensional stationary convection-diffusion problem are constructed and analysed. The iteration counts,...
H. MacMullen, Eugene O'Riordan, Grigorii I. Shishk...
NIPS
2007
15 years 5 months ago
Estimating divergence functionals and the likelihood ratio by penalized convex risk minimization
We develop and analyze an algorithm for nonparametric estimation of divergence functionals and the density ratio of two probability distributions. Our method is based on a variati...
XuanLong Nguyen, Martin J. Wainwright, Michael I. ...