Search Sciweavers | Sciweavers

1233 search results - page 144 / 247

» Reinforcement learning

177

click to vote

AAAI
2007

104views Intelligent Agents» more AAAI 2007»

Active Imitation Learning

15 years 8 months ago

Download www.cs.washington.edu

Imitation learning, also called learning by watching or programming by demonstration, has emerged as a means of accelerating many reinforcement learning tasks. Previous work has s...

Aaron P. Shon, Deepak Verma, Rajesh P. N. Rao

claim paper

Read More »

160

click to vote

ECAL
2007
Springer

227views Artificial Intelligence» more ECAL 2007»

Guided Self-organisation for Autonomous Robot Development

16 years 13 days ago

Download robot.informatik.uni-leipzig.de

Abstract. The paper presents a method to guide the self-organised development of behaviours of autonomous robots. In earlier publications we demonstrated how to use the homeokinesi...

Georg Martius, J. Michael Herrmann, Ralf Der

claim paper

Read More »

171

click to vote

ICML
2001
IEEE

185views Machine Learning» more ICML 2001»

Off-Policy Temporal Difference Learning with Function Approximation

16 years 7 months ago

Download www.cs.ualberta.ca

We introduce the first algorithm for off-policy temporal-difference learning that is stable with linear function approximation. Off-policy learning is of interest because it forms...

Doina Precup, Richard S. Sutton, Sanjoy Dasgupta

claim paper

Read More »

153

click to vote

ICML
2005
IEEE

137views Machine Learning» more ICML 2005»

Learning to compete, compromise, and cooperate in repeated general-sum games

16 years 7 months ago

Download www.mit.edu

Learning algorithms often obtain relatively low average payoffs in repeated general-sum games between other learning agents due to a focus on myopic best-response and one-shot Nas...

Jacob W. Crandall, Michael A. Goodrich

claim paper

Read More »

174

click to vote

HT
2009
ACM

146views Internet Technology» more HT 2009»

Improving recommender systems with adaptive conversational strategies

16 years 24 days ago

Download www.inf.unibz.it

Conversational recommender systems (CRSs) assist online users in their information-seeking and decision making tasks by supporting an interactive process. Although these processes...

Tariq Mahmood, Francesco Ricci

claim paper

Read More »

« Prev « First page 144 / 247 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers