Search Sciweavers | Sciweavers

115

Voted

CHI
2010
ACM

188views Human Computer Interaction» more CHI 2010»

Expressive robots in education: varying the degree of social supportive behavior of a robotic tutor

14 years 6 months ago

Teaching is inherently a social interaction between teacher and student. Despite this knowledge, many educational tools, such as vocabulary training programs, still model the inte...

Martin Saerbeck, Tom Schut, Christoph Bartneck, Ma...

claim paper

Read More »

62

Voted

TIT
2008

76views more TIT 2008»

Improved Risk Tail Bounds for On-Line Algorithms

14 years 9 months ago

Download books.nips.cc

We prove the strongest known bound for the risk of hypotheses selected from the ensemble generated by running a learning algorithm incrementally on the training data. Our result i...

Nicolò Cesa-Bianchi, Claudio Gentile

claim paper

Read More »

90

click to vote

COLT
2004
Springer

87views Machine Learning» more COLT 2004»

Replacing Limit Learners with Equally Powerful One-Shot Query Learners

15 years 3 months ago

Download www2.cs.uregina.ca

Diﬀerent formal learning models address diﬀerent aspects of human learning. Below we compare Gold-style learning—interpreting learning as a limiting process in which the lear...

Steffen Lange, Sandra Zilles

claim paper

Read More »

83

click to vote

NIPS
2007

164views Information Technology» more NIPS 2007»

Incremental Natural Actor-Critic Algorithms

14 years 11 months ago

Download books.nips.cc

We present four new reinforcement learning algorithms based on actor-critic and natural-gradient ideas, and provide their convergence proofs. Actor-critic reinforcement learning m...

Shalabh Bhatnagar, Richard S. Sutton, Mohammad Gha...

claim paper

Read More »

90

click to vote

ICML
2007
IEEE

180views Machine Learning» more ICML 2007»

Tracking value function dynamics to improve reinforcement learning with piecewise linear function approximation

15 years 10 months ago

Download www.machinelearning.org

Reinforcement learning algorithms can become unstable when combined with linear function approximation. Algorithms that minimize the mean-square Bellman error are guaranteed to co...

Chee Wee Phua, Robert Fitch

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers