Search Sciweavers | Sciweavers

We provide a provably efﬁcient algorithm for learning Markov Decision Processes (MDPs) with continuous state and action spaces in the online setting. Speciﬁcally, we take a mo...

Alexander L. Strehl, Michael L. Littman

claim paper

Read More »

186

click to vote

COLT
2008
Springer

115views Machine Learning» more COLT 2008»

Competing in the Dark: An Efficient Algorithm for Bandit Linear Optimization

15 years 8 months ago

Download www-stat.wharton.upenn.edu

We introduce an efficient algorithm for the problem of online linear optimization in the bandit setting which achieves the optimal O ( T) regret. The setting is a natural general...

Jacob Abernethy, Elad Hazan, Alexander Rakhlin

claim paper

Read More »

154

click to vote

IIE
2008

104views more IIE 2008»

Exploring Technologies, Materials, and Methods for an Online Foundational Programming Course

15 years 7 months ago

Download www.mii.lt

Introductory computer programming courses are inherently challenging for a variety of reasons. With increased demands for online delivery, the use of effective technologies, materi...

Eman El-Sheikh, John W. Coffey, Laura J. White

claim paper

Read More »

159

click to vote

ICML
2008
IEEE

147views Machine Learning» more ICML 2008»

Apprenticeship learning using linear programming

16 years 7 months ago

Download www.cs.ualberta.ca

In apprenticeship learning, the goal is to learn a policy in a Markov decision process that is at least as good as a policy demonstrated by an expert. The difficulty arises in tha...

Umar Syed, Michael H. Bowling, Robert E. Schapire

claim paper

Read More »

« Prev « First page 3 / 95 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers