Sciweavers

2108 search results - page 336 / 422
» Tracking in Reinforcement Learning
Sort
View
UAI
2003
15 years 1 months ago
On the Convergence of Bound Optimization Algorithms
Many practitioners who use EM and related algorithms complain that they are sometimes slow. When does this happen, and what can be done about it? In this paper, we study the gener...
Ruslan Salakhutdinov, Sam T. Roweis, Zoubin Ghahra...
KDD
2010
ACM
289views Data Mining» more  KDD 2010»
14 years 9 months ago
Exploitation and exploration in a performance based contextual advertising system
The dynamic marketplace in online advertising calls for ranking systems that are optimized to consistently promote and capitalize better performing ads. The streaming nature of on...
Wei Li 0010, Xuerui Wang, Ruofei Zhang, Ying Cui, ...
EMNLP
2011
13 years 11 months ago
Watermarking the Outputs of Structured Prediction with an application in Statistical Machine Translation
We propose a general method to watermark and probabilistically identify the structured outputs of machine learning algorithms. Our method is robust to local editing operations and...
Ashish Venugopal, Jakob Uszkoreit, David Talbot, F...
EDUTAINMENT
2009
Springer
15 years 6 months ago
Game-Like Simulations for Online Adaptive Learning: A Case Study
Serious games are becoming a powerful tool in education. However, there are still open issues needing further research to generalize the use of videogames and game-like simulations...
Javier Torrente, Pablo Moreno-Ger, Baltasar Fern&a...
IADIS
2003
15 years 1 months ago
Knowledge Acquisition Strategies and Navigation in Hypermedia Learning Environments: THe Influence of Instructional Design Prope
In order to understand and enhance the value of new media in education it is necessary to develop criteria for the evaluation of the effectiveness of learning with hypermedia envi...
Mattias Steinke, Thomas Huk, Christian Floto