Search Sciweavers | Sciweavers

114

COLT
2000
Springer

87views Machine Learning» more COLT 2000»

Estimation and Approximation Bounds for Gradient-Based Reinforcement Learning

15 years 8 months ago

We model reinforcement learning as the problem of learning to control a Partially Observable Markov Decision Process ( ¢¡¤£¦¥§ ), and focus on gradient ascent approache...

Peter L. Bartlett, Jonathan Baxter

claim paper

Read More »

145

click to vote

ECOOP
2000
Springer

99views Programming Languages» more ECOOP 2000»

Using Objects for Next Generation Communication Services

15 years 8 months ago

Download www.ifs.uni-linz.ac.at

The integration of the telephone network and the internet enables convergence of voice and data services. The explosion of information appliances also provides new service opportun...

Munir Cochinwala

claim paper

Read More »

124

click to vote

FSTTCS
1993
Springer

92views Software Engineering» more FSTTCS 1993»

Higher-Order and Semantic Unification

15 years 8 months ago

Download www.cs.tau.ac.il

Abstract. We provide a complete system of transformation rules for semantic unification with respect to theories defined by convergent rewrite systems. We show that this standard u...

Nachum Dershowitz, Subrata Mitra

claim paper

Read More »

109

click to vote

NAA
2000
Springer

88views Mathematics» more NAA 2000»

Schwarz Methods for Convection-Diffusion Problems

15 years 8 months ago

Download www.dcu.ie

Abstract. Various variants of Schwarz methods for a singularly perturbed two dimensional stationary convection-diffusion problem are constructed and analysed. The iteration counts,...

H. MacMullen, Eugene O'Riordan, Grigorii I. Shishk...

claim paper

Read More »

139

click to vote

NIPS
2007

128views Information Technology» more NIPS 2007»

Estimating divergence functionals and the likelihood ratio by penalized convex risk minimization

15 years 5 months ago

Download books.nips.cc

We develop and analyze an algorithm for nonparametric estimation of divergence functionals and the density ratio of two probability distributions. Our method is based on a variati...

XuanLong Nguyen, Martin J. Wainwright, Michael I. ...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers