Search Sciweavers | Sciweavers

148

ICML
2010
IEEE

167views Machine Learning» more ICML 2010»

15 years 7 months ago

In this paper we consider the problem of policy evaluation in reinforcement learning, i.e., learning the value function of a fixed policy, using the least-squares temporal-differe...

Alessandro Lazaric, Mohammad Ghavamzadeh, Ré...

claim paper

Read More »

206

click to vote

AGI
2008

142views Artificial Intelligence» more AGI 2008»

Transfer Learning and Intelligence: an Argument and Approach

15 years 7 months ago

Download www.cs.utexas.edu

In order to claim fully general intelligence in an autonomous agent, the ability to learn is one of the most central capabilities. Classical machine learning techniques have had ma...

Matthew E. Taylor, Gregory Kuhlmann, Peter Stone

claim paper

Read More »

177

Voted

IJCNN
2008
IEEE

202views Neural Networks» more IJCNN 2008»

Learning to select relevant perspective in a dynamic environment

16 years 15 days ago

Download www.cs.qub.ac.uk

— When an agent observes its environment, there are two important characteristics of the perceived information. One is the relevance of information and the other is redundancy. T...

Zhihui Luo, David A. Bell, Barry McCollum, Qingxia...

claim paper

Read More »

151

click to vote

EPIA
1995
Springer

110views Artificial Intelligence» more EPIA 1995»

Using Stochastic Grammars to Learn Robotic Tasks

15 years 9 months ago

Download welcome.isr.ist.utl.pt

Abstract. The paper introduces a reinforcement learning-based methodology for performance improvement of Intelligent Controllers. The translation interfaces of a 3-level Hierarchic...

Pedro U. Lima, George N. Saridis

claim paper

Read More »

162

click to vote

SIGIR
2003
ACM

116views Information Technology» more SIGIR 2003»

ReCoM: reinforcement clustering of multi-type interrelated data objects

15 years 11 months ago

Download research.microsoft.com

Most existing clustering algorithms cluster highly related data objects such as Web pages and Web users separately. The interrelation among different types of data objects is eith...

Jidong Wang, Hua-Jun Zeng, Zheng Chen, Hongjun Lu,...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers