Search Sciweavers | Sciweavers

We introduce an efficient algorithm for the problem of online linear optimization in the bandit setting which achieves the optimal O ( T) regret. The setting is a natural general...

Jacob Abernethy, Elad Hazan, Alexander Rakhlin

claim paper

Read More »

123

Voted

ICML
1996
IEEE

162views Machine Learning» more ICML 1996»

Sensitive Discount Optimality: Unifying Discounted and Average Reward Reinforcement Learning

16 years 2 months ago

Download reference.kfupm.edu.sa

Research in reinforcementlearning (RL)has thus far concentrated on two optimality criteria: the discounted framework, which has been very well-studied, and the averagereward frame...

Sridhar Mahadevan

claim paper

Read More »

106

Voted

DIGITEL
2008
IEEE

234views Artificial Intelligence» more DIGITEL 2008»

Using Humanoid Robots as Instructional Media in Elementary Language Education

15 years 8 months ago

Download www.ncu.edu.tw

As robot technologies have developed rapidly, many researchers have tried to use robots to support education. Studies have shown that robots can help students develop problem-solv...

Gwo-Dong Chen, Chih-Wei Chang

claim paper

Read More »

« Prev « First page 123 / 1469 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers