Search Sciweavers | Sciweavers

2223 search results - page 119 / 445

» Implicit Online Learning

166

AAAI
2006

127views Intelligent Agents» more AAAI 2006»

Reinforcement Learning with Human Teachers: Evidence of Feedback and Guidance with Implications for Learning Performance

15 years 6 months ago

Download robotic.media.mit.edu

As robots become a mass consumer product, they will need to learn new skills by interacting with typical human users. Past approaches have adapted reinforcement learning (RL) to a...

Andrea Lockerd Thomaz, Cynthia Breazeal

claim paper

Read More »

146

Voted

SDM
2007
SIAM

133views Data Mining» more SDM 2007»

Change-Point Detection using Krylov Subspace Learning

15 years 6 months ago

Download siam.org

We propose an eﬃcient algorithm for principal component analysis (PCA) that is applicable when only the inner product with a given vector is needed. We show that Krylov subspace...

Tsuyoshi Idé, Koji Tsuda

claim paper

Read More »

130

click to vote

IPM
2010

106views more IPM 2010»

An asynchronous collaborative search system for online video search

15 years 3 months ago

Download ir.ii.uam.es

There are a number of multimedia tasks and environments that can be collaborative in nature and involve contributions from more than one individual. Examples of such tasks include...

Martin Halvey, David Vallet, David Hannah, Yue Fen...

claim paper

Read More »

145

click to vote

ICML
2008
IEEE

151views Machine Learning» more ICML 2008»

On-line discovery of temporal-difference networks

16 years 5 months ago

Download www.snowelm.com

We present an algorithm for on-line, incremental discovery of temporal-difference (TD) networks. The key contribution is the establishment of three criteria to expand a node in TD...

Takaki Makino, Toshihisa Takagi

claim paper

Read More »

127

click to vote

ALT
2008
Springer

141views Machine Learning» more ALT 2008»

Online Regret Bounds for Markov Decision Processes with Deterministic Transitions

16 years 1 months ago

Download personal.unileoben.ac.at

Abstract. We consider an upper conﬁdence bound algorithm for Markov decision processes (MDPs) with deterministic transitions. For this algorithm we derive upper bounds on the onl...

Ronald Ortner

claim paper

Read More »

« Prev « First page 119 / 445 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers