Sciweavers

388 search results - page 6 / 78
» Convergence Properties of the K-Means Algorithms
Sort
View
JMLR
2010
119views more  JMLR 2010»
14 years 4 months ago
A Convergent Online Single Time Scale Actor Critic Algorithm
Actor-Critic based approaches were among the first to address reinforcement learning in a general setting. Recently, these algorithms have gained renewed interest due to their gen...
Dotan Di Castro, Ron Meir
87
Voted
TSP
2008
151views more  TSP 2008»
14 years 9 months ago
Convergence Analysis of Reweighted Sum-Product Algorithms
Markov random fields are designed to represent structured dependencies among large collections of random variables, and are well-suited to capture the structure of real-world sign...
Tanya Roosta, Martin J. Wainwright, Shankar S. Sas...
ICML
2009
IEEE
15 years 10 months ago
Gradient descent with sparsification: an iterative algorithm for sparse recovery with restricted isometry property
We present an algorithm for finding an ssparse vector x that minimizes the squareerror y - x 2 where satisfies the restricted isometry property (RIP), with isometric constant 2s ...
Rahul Garg, Rohit Khandekar
IJCAI
2001
14 years 11 months ago
Rational and Convergent Learning in Stochastic Games
This paper investigates the problem of policy learning in multiagent environments using the stochastic game framework, which we briefly overview. We introduce two properties as de...
Michael H. Bowling, Manuela M. Veloso
IFM
2010
Springer
147views Formal Methods» more  IFM 2010»
14 years 7 months ago
Symbolic Model-Checking of Optimistic Replication Algorithms
Abstract. The Operational Transformation (OT) approach, used in many collaborative editors, allows a group of users to concurrently update replicas of a shared object and exchange ...
Hanifa Boucheneb, Abdessamad Imine, Manal Najem