Sciweavers

945 search results - page 81 / 189
» Dialog Convergence and Learning
Sort
View
ICML
2001
IEEE
15 years 10 months ago
Off-Policy Temporal Difference Learning with Function Approximation
We introduce the first algorithm for off-policy temporal-difference learning that is stable with linear function approximation. Off-policy learning is of interest because it forms...
Doina Precup, Richard S. Sutton, Sanjoy Dasgupta
ACIIDS
2010
IEEE
204views Database» more  ACIIDS 2010»
15 years 2 months ago
An Unsupervised Learning and Statistical Approach for Vietnamese Word Recognition and Segmentation
There are two main topics in this paper: (i) Vietnamese words are recognized and sentences are segmented into words by using probabilistic models; (ii) the optimum probabilistic mo...
Hieu Le Trung, Vu Le Anh, Kien Le Trung
IJCAI
1997
14 years 11 months ago
An Effective Learning Method for Max-Min Neural Networks
Max and min operations have interesting properties that facilitate the exchange of information between the symbolic and real-valued domains. As such, neural networks that employ m...
Loo-Nin Teow, Kia-Fock Loe
NIPS
1994
14 years 11 months ago
Generalization in Reinforcement Learning: Safely Approximating the Value Function
To appear in: G. Tesauro, D. S. Touretzky and T. K. Leen, eds., Advances in Neural Information Processing Systems 7, MIT Press, Cambridge MA, 1995. A straightforward approach to t...
Justin A. Boyan, Andrew W. Moore
NIPS
1996
14 years 11 months ago
Radial Basis Function Networks and Complexity Regularization in Function Learning
In this paper we apply the method of complexity regularization to derive estimation bounds for nonlinear function estimation using a single hidden layer radial basis function netwo...
Adam Krzyzak, Tamás Linder