Search Sciweavers | Sciweavers

13

ICML
1995
IEEE

131views Machine Learning» more ICML 1995»

Learning with Rare Cases and Small Disjuncts

13 years 8 months ago

Systems that learn from examples often create a disjunctive concept deﬁnition. Small disjuncts are those disjuncts which cover only a few training examples. The problem with sma...

Gary M. Weiss

claim paper

Read More »

16

click to vote

ICML
1995
IEEE

110views Machine Learning» more ICML 1995»

Learning by Observation and Practice: An Incremental Approach for Planning Operator Acquisition

14 years 6 months ago

Download reference.kfupm.edu.sa

This paper describes an approach to automatically learn planning operators by observing expert solution traces and to further refine the operators through practice in a learning-b...

Xuemei Wang

claim paper

Read More »

16

click to vote

ICML
1995
IEEE

196views Machine Learning» more ICML 1995»

Ant-Q: A Reinforcement Learning Approach to the Traveling Salesman Problem

14 years 6 months ago

Download www.idsia.ch

In this paper we introduce Ant-Q, a family of algorithms which present many similarities with Q-learning (Watkins, 1989), and which we apply to the solution of symmetric and asymm...

Luca Maria Gambardella, Marco Dorigo

claim paper

Read More »

17

click to vote

ICML
1995
IEEE

213views Machine Learning» more ICML 1995»

Learning Policies for Partially Observable Environments: Scaling Up

14 years 6 months ago

Download reference.kfupm.edu.sa

Partially observable Markov decision processes (pomdp's) model decision problems in which an agent tries to maximize its reward in the face of limited and/or noisy sensor fee...

Michael L. Littman, Anthony R. Cassandra, Leslie P...

claim paper

Read More »

15

click to vote

ICML
2001
IEEE

185views Machine Learning» more ICML 2001»

Off-Policy Temporal Difference Learning with Function Approximation

14 years 6 months ago

Download www.cs.ualberta.ca

We introduce the first algorithm for off-policy temporal-difference learning that is stable with linear function approximation. Off-policy learning is of interest because it forms...

Doina Precup, Richard S. Sutton, Sanjoy Dasgupta

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers