Sciweavers

699 search results - page 10 / 140
» Online Dynamic Value System for Machine Learning
Sort
View
NIPS
1996
15 years 10 days ago
Multidimensional Triangulation and Interpolation for Reinforcement Learning
Dynamic Programming, Q-learning and other discrete Markov Decision Process solvers can be applied to continuous d-dimensional state-spaces by quantizing the state space into an arr...
Scott Davies
ICML
2009
IEEE
15 years 11 months ago
Interactively optimizing information retrieval systems as a dueling bandits problem
We present an on-line learning framework tailored towards real-time learning from observed user behavior in search engines and other information retrieval systems. In particular, ...
Yisong Yue, Thorsten Joachims
ICALT
2005
IEEE
15 years 4 months ago
MALESAbrain for Problem-Based Learning in IT Education
This paper reports MALESAbrain an intelligent online tool for problem-based learning (PBL) in IT education. The learning model of MALESAbrain is built on the notions of threshold ...
Akcell Chiang, Mohd Sapiyan Baba
ECML
2007
Springer
15 years 5 months ago
Discriminative Sequence Labeling by Z-Score Optimization
Abstract. We consider a new discriminative learning approach to sequence labeling based on the statistical concept of the Z-score. Given a training set of pairs of hidden-observed ...
Elisa Ricci, Tijl De Bie, Nello Cristianini
ICML
2009
IEEE
15 years 11 months ago
Analytic moment-based Gaussian process filtering
We propose an analytic moment-based filter for nonlinear stochastic dynamic systems modeled by Gaussian processes. Exact expressions for the expected value and the covariance matr...
Marc Peter Deisenroth, Marco F. Huber, Uwe D. Hane...