Search Sciweavers | Sciweavers

87

ICML
1995
IEEE

131views Machine Learning» more ICML 1995»

Learning with Rare Cases and Small Disjuncts

15 years 3 months ago

Systems that learn from examples often create a disjunctive concept deﬁnition. Small disjuncts are those disjuncts which cover only a few training examples. The problem with sma...

Gary M. Weiss

claim paper

Read More »

102

click to vote

ICML
1995
IEEE

110views Machine Learning» more ICML 1995»

Learning by Observation and Practice: An Incremental Approach for Planning Operator Acquisition

16 years 16 days ago

Download reference.kfupm.edu.sa

This paper describes an approach to automatically learn planning operators by observing expert solution traces and to further refine the operators through practice in a learning-b...

Xuemei Wang

claim paper

Read More »

136

click to vote

ICML
1995
IEEE

196views Machine Learning» more ICML 1995»

Ant-Q: A Reinforcement Learning Approach to the Traveling Salesman Problem

16 years 16 days ago

Download www.idsia.ch

In this paper we introduce Ant-Q, a family of algorithms which present many similarities with Q-learning (Watkins, 1989), and which we apply to the solution of symmetric and asymm...

Luca Maria Gambardella, Marco Dorigo

claim paper

Read More »

125

click to vote

ICML
1995
IEEE

213views Machine Learning» more ICML 1995»

Learning Policies for Partially Observable Environments: Scaling Up

16 years 16 days ago

Download reference.kfupm.edu.sa

Partially observable Markov decision processes (pomdp's) model decision problems in which an agent tries to maximize its reward in the face of limited and/or noisy sensor fee...

Michael L. Littman, Anthony R. Cassandra, Leslie P...

claim paper

Read More »

106

click to vote

ICML
2001
IEEE

185views Machine Learning» more ICML 2001»

Off-Policy Temporal Difference Learning with Function Approximation

16 years 16 days ago

Download www.cs.ualberta.ca

We introduce the first algorithm for off-policy temporal-difference learning that is stable with linear function approximation. Off-policy learning is of interest because it forms...

Doina Precup, Richard S. Sutton, Sanjoy Dasgupta

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers