Sciweavers

945 search results - page 30 / 189
» Dialog Convergence and Learning
Sort
View
ISNN
2005
Springer
15 years 9 months ago
Enhanced Fuzzy Single Layer Perceptron
Abstract. In this paper, a method of improving the learning time and convergence rate is proposed to exploit the advantages of artificial neural networks and fuzzy theory to neuron...
Kwang-Baek Kim, Sungshin Kim, Young Hoon Joo, Am S...
139
Voted
NIPS
1998
15 years 4 months ago
Gradient Descent for General Reinforcement Learning
A simple learning rule is derived, the VAPS algorithm, which can be instantiated to generate a wide range of new reinforcementlearning algorithms. These algorithms solve a number ...
Leemon C. Baird III, Andrew W. Moore
144
Voted
STOC
2006
ACM
122views Algorithms» more  STOC 2006»
16 years 3 months ago
Fast convergence to Wardrop equilibria by adaptive sampling methods
We study rerouting policies in a dynamic round-based variant of a well known game theoretic traffic model due to Wardrop. Previous analyses (mostly in the context of selfish routi...
Simon Fischer, Harald Räcke, Berthold Vö...
136
Voted
NIPS
1998
15 years 4 months ago
Finite-Sample Convergence Rates for Q-Learning and Indirect Algorithms
In this paper, we address two issues of long-standing interest in the reinforcement learning literature. First, what kinds of performance guarantees can be made for Q-learning aft...
Michael J. Kearns, Satinder P. Singh
FLOPS
2006
Springer
15 years 7 months ago
Convergence in Language Design: A Case of Lightning Striking Four Times in the Same Place
What will a definitive programming language look like? By definitive language I mean a programming language that gives good soat its level of abstraction, allowing computer science...
Peter Van Roy