Search Sciweavers | Sciweavers

2108 search results - page 151 / 422

» Tracking in Reinforcement Learning

119

Voted

CVPR
2010
IEEE

292views Computer Vision» more CVPR 2010»

An Online Approach: Learning-Semantic-Scene-by-Tracking and Tracking-by-Learning-Semantic-Scene

15 years 8 months ago

Download www.cis.pku.edu.cn

Learning the knowledge of scene structure and tracking a large number of targets are both active topics of computer vision in recent years, which plays a crucial role in surveilla...

Xuan Song, Xiaowei Shao, Huijing Zhao, Jinshi Cui,...

claim paper

Read More »

142

click to vote

HT
2009
ACM

146views Internet Technology» more HT 2009»

Improving recommender systems with adaptive conversational strategies

15 years 10 months ago

Download www.inf.unibz.it

Conversational recommender systems (CRSs) assist online users in their information-seeking and decision making tasks by supporting an interactive process. Although these processes...

Tariq Mahmood, Francesco Ricci

claim paper

Read More »

100

Voted

ATAL
2007
Springer

108views Intelligent Agents» more ATAL 2007»

Dynamic task allocation within an open service-oriented MAS architecture

15 years 9 months ago

Download www.isys.ucl.ac.be

A MAS architecture consisting of service centers is proposed. Within each service center, a mediator coordinates service delivery by allocating individual tasks to corresponding t...

Ivan Jureta, Stéphane Faulkner, Youssef Ach...

claim paper

Read More »

137

Voted

GECCO
2010
Springer

153views Optimization» more GECCO 2010»

Multi-task evolutionary shaping without pre-specified representations

15 years 6 months ago

Download www.science.uva.nl

Shaping functions can be used in multi-task reinforcement learning (RL) to incorporate knowledge from previously experienced tasks to speed up learning on a new task. So far, rese...

Matthijs Snel, Shimon Whiteson

claim paper

Read More »

131

click to vote

ATAL
2008
Springer

123views Intelligent Agents» more ATAL 2008»

Sigma point policy iteration

15 years 5 months ago

Download web.mit.edu

In reinforcement learning, least-squares temporal difference methods (e.g., LSTD and LSPI) are effective, data-efficient techniques for policy evaluation and control with linear v...

Michael H. Bowling, Alborz Geramifard, David Winga...

claim paper

Read More »

« Prev « First page 151 / 422 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers