Sciweavers

1227 search results - page 21 / 246
» Learning Rates for Q-Learning
Sort
View
SAGT
2010
Springer
127views Game Theory» more  SAGT 2010»
14 years 8 months ago
On the Rate of Convergence of Fictitious Play
Fictitious play is a simple learning algorithm for strategic games that proceeds in rounds. In each round, the players play a best response to a mixed strategy that is given by the...
Felix Brandt, Felix A. Fischer, Paul Harrenstein
MVA
2002
195views Computer Vision» more  MVA 2002»
14 years 9 months ago
Improved Adaptive Mixture Learning for Robust Video Background Modeling
2 Related Works Gaussian mixtures are often used for data modeling in many real-time applications such as video background modeling and speaker direction tracking. The real-time a...
Dar-Shyang Lee
AAMAS
2011
Springer
14 years 4 months ago
Using focal point learning to improve human-machine tacit coordination
We consider an automated agent that needs to coordinate with a human partner when communication between them is not possible or is undesirable (tacit coordination games). Specifi...
Inon Zuckerman, Sarit Kraus, Jeffrey S. Rosenschei...
SDM
2012
SIAM
252views Data Mining» more  SDM 2012»
13 years 2 days ago
Learning from Heterogeneous Sources via Gradient Boosting Consensus
Multiple data sources containing different types of features may be available for a given task. For instance, users’ profiles can be used to build recommendation systems. In a...
Xiaoxiao Shi, Jean-François Paiement, David...
CORR
2010
Springer
80views Education» more  CORR 2010»
14 years 9 months ago
Multi-path Probabilistic Available Bandwidth Estimation through Bayesian Active Learning
Knowing the largest rate at which data can be sent on an end-to-end path such that the egress rate is equal to the ingress rate with high probability can be very practical when ch...
Frederic Thouin, Mark Coates, Michael Rabbat