Sciweavers

472 search results - page 3 / 95
» Linear programming with online learning
Sort
View
NIPS
2007
13 years 7 months ago
Online Linear Regression and Its Application to Model-Based Reinforcement Learning
We provide a provably efficient algorithm for learning Markov Decision Processes (MDPs) with continuous state and action spaces in the online setting. Specifically, we take a mo...
Alexander L. Strehl, Michael L. Littman
COLT
2008
Springer
13 years 7 months ago
Competing in the Dark: An Efficient Algorithm for Bandit Linear Optimization
We introduce an efficient algorithm for the problem of online linear optimization in the bandit setting which achieves the optimal O ( T) regret. The setting is a natural general...
Jacob Abernethy, Elad Hazan, Alexander Rakhlin
IIE
2008
104views more  IIE 2008»
13 years 6 months ago
Exploring Technologies, Materials, and Methods for an Online Foundational Programming Course
Introductory computer programming courses are inherently challenging for a variety of reasons. With increased demands for online delivery, the use of effective technologies, materi...
Eman El-Sheikh, John W. Coffey, Laura J. White
ICML
2008
IEEE
14 years 6 months ago
Apprenticeship learning using linear programming
In apprenticeship learning, the goal is to learn a policy in a Markov decision process that is at least as good as a policy demonstrated by an expert. The difficulty arises in tha...
Umar Syed, Michael H. Bowling, Robert E. Schapire