Sciweavers

472 search results - page 3 / 95
» Linear programming with online learning
Sort
View
NIPS
2007
15 years 1 months ago
Online Linear Regression and Its Application to Model-Based Reinforcement Learning
We provide a provably efficient algorithm for learning Markov Decision Processes (MDPs) with continuous state and action spaces in the online setting. Specifically, we take a mo...
Alexander L. Strehl, Michael L. Littman
87
Voted
COLT
2008
Springer
15 years 1 months ago
Competing in the Dark: An Efficient Algorithm for Bandit Linear Optimization
We introduce an efficient algorithm for the problem of online linear optimization in the bandit setting which achieves the optimal O ( T) regret. The setting is a natural general...
Jacob Abernethy, Elad Hazan, Alexander Rakhlin
83
Voted
IIE
2008
104views more  IIE 2008»
14 years 11 months ago
Exploring Technologies, Materials, and Methods for an Online Foundational Programming Course
Introductory computer programming courses are inherently challenging for a variety of reasons. With increased demands for online delivery, the use of effective technologies, materi...
Eman El-Sheikh, John W. Coffey, Laura J. White
ICML
2008
IEEE
16 years 13 days ago
Apprenticeship learning using linear programming
In apprenticeship learning, the goal is to learn a policy in a Markov decision process that is at least as good as a policy demonstrated by an expert. The difficulty arises in tha...
Umar Syed, Michael H. Bowling, Robert E. Schapire