Search Sciweavers | Sciweavers

223 search results - page 32 / 45

» Least-Squares Temporal Difference Learning

click to vote

JMLR
2002

100views more JMLR 2002»

On the Convergence of Optimistic Policy Iteration

14 years 9 months ago

Download www.mit.edu

We consider a finite-state Markov decision problem and establish the convergence of a special case of optimistic policy iteration that involves Monte Carlo estimation of Q-values,...

John N. Tsitsiklis

claim paper

Read More »

click to vote

FLAIRS
2004

106views Artificial Intelligence» more FLAIRS 2004»

On the Pedagogically Guided Paper Recommendation for an Evolving Web-Based Learning System

14 years 11 months ago

Download www.aaai.org

In this paper we discuss the mechanism of a recommender system recommending papers for an evolving web-based learning system. Our system is unique in three aspects. The first is t...

Tiffany Ya Tang, Gordon I. McCalla

claim paper

Read More »

click to vote

ICPR
2006
IEEE

129views computer vision» more ICPR 2006»

Robust Recursive Learning for Foreground Region Detection in Videos with Quasi-Stationary Backgrounds

15 years 10 months ago

Download www.cse.unr.edu

Detecting regions of interest in video sequences is the most important task in many high level video processing applications. In this paper a robust technique based on recursive l...

Alireza Tavakkoli, George Bebis, Mircea Nicolescu

claim paper

Read More »

click to vote

AAAI
2006

161views Intelligent Agents» more AAAI 2006»

Sample-Efficient Evolutionary Function Approximation for Reinforcement Learning

14 years 11 months ago

Download staff.science.uva.nl

Reinforcement learning problems are commonly tackled with temporal difference methods, which attempt to estimate the agent's optimal value function. In most real-world proble...

Shimon Whiteson, Peter Stone

claim paper

Read More »

119

click to vote

CVPR
2009
IEEE

314views Computer Vision» more CVPR 2009»

Learning sign language by watching TV (using weakly aligned subtitles)

16 years 5 months ago

Download www.comp.leeds.ac.uk

The goal of this work is to automatically learn a large number of British Sign Language (BSL) signs from TV broadcasts. We achieve this by using the supervisory information avai...

Patrick Buehler (University of Oxford), Mark Everi...

claim paper

Read More »

« Prev « First page 32 / 45 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers