We introduce the ALeRT (Action-dependent Learning Rates with Trends) algorithm that makes two modifications to the learning rate and one change to the exploration rate of traditio...
Maria Cutumisu, Duane Szafron, Michael H. Bowling,...
In this paper we study the use of experts algorithms in a multiagent setting. In this paper we allow agents to use multiple experts and explore different experts algorithms that a...
Serious games are becoming a powerful tool in education. However, there are still open issues needing further research to generalize the use of videogames and game-like simulations...
Javier Torrente, Pablo Moreno-Ger, Baltasar Fern&a...
We present a new family of subgradient methods that dynamically incorporate knowledge of the geometry of the data observed in earlier iterations to perform more informative gradie...
2 Related Works Gaussian mixtures are often used for data modeling in many real-time applications such as video background modeling and speaker direction tracking. The real-time a...