Sciweavers

2711 search results - page 425 / 543
» Convergence of the Wake-Sleep Algorithm
Sort
View
SODA
2008
ACM
122views Algorithms» more  SODA 2008»
14 years 11 months ago
A fractional model of the border gateway protocol (BGP)
The Border Gateway Protocol (BGP) is the interdomain routing protocol used to exchange routing information between Autonomous Systems (ASes) in the internet today. While intradoma...
Penny E. Haxell, Gordon T. Wilfong
UAI
2004
14 years 11 months ago
Heuristic Search Value Iteration for POMDPs
We present a novel POMDP planning algorithm called heuristic search value iteration (HSVI). HSVI is an anytime algorithm that returns a policy and a provable bound on its regret w...
Trey Smith, Reid G. Simmons
WSC
2004
14 years 11 months ago
Adaptive Wavelet Neural Network for Prediction of Hourly NOx and NO2 Concentrations
Adaptive neural network is a powerful tool for prediction of air pollution abatement scenarios. But it is often difficult to avoid overfit during the training of adaptive neural n...
Zhiguo Zhang, Ye San
NIPS
1996
14 years 11 months ago
Transformation Invariance in Pattern Recognition-Tangent Distance and Tangent Propagation
In pattern recognition, statistical modeling, or regression, the amount of data is a critical factor affecting the performance. If the amount of data and computational resources ar...
Patrice Simard, Yann LeCun, John S. Denker, Bernar...
ECAI
2010
Springer
14 years 11 months ago
The Dynamics of Multi-Agent Reinforcement Learning
Abstract. Infinite-horizon multi-agent control processes with nondeterminism and partial state knowledge have particularly interesting properties with respect to adaptive control, ...
Luke Dickens, Krysia Broda, Alessandra Russo