Sciweavers

ICML
1995
IEEE
13 years 8 months ago
Learning with Rare Cases and Small Disjuncts
Systems that learn from examples often create a disjunctive concept definition. Small disjuncts are those disjuncts which cover only a few training examples. The problem with sma...
Gary M. Weiss
ICML
1995
IEEE
13 years 8 months ago
Learning Collection FUsion Strategies for Information Retrieval
In this paper we describe an Information Retrieval problem called collection fusion. The collection fusion problem is to maximize the number of relevant natural language documents...
Geoffrey G. Towell, Ellen M. Voorhees, Narendra Ku...
ICML
1995
IEEE
13 years 8 months ago
Efficient Learning with Virtual Threshold Gates
Wolfgang Maass, Manfred K. Warmuth
ICML
1995
IEEE
14 years 5 months ago
Learning by Observation and Practice: An Incremental Approach for Planning Operator Acquisition
This paper describes an approach to automatically learn planning operators by observing expert solution traces and to further refine the operators through practice in a learning-b...
Xuemei Wang
ICML
1995
IEEE
14 years 5 months ago
Learning Policies for Partially Observable Environments: Scaling Up
Partially observable Markov decision processes (pomdp's) model decision problems in which an agent tries to maximize its reward in the face of limited and/or noisy sensor fee...
Michael L. Littman, Anthony R. Cassandra, Leslie P...
ICML
1995
IEEE
14 years 5 months ago
Learning to Make Rent-to-Buy Decisions with Systems Applications
In the single rent-to-buy decision problem, without a priori knowledge of the amount of time a resource will be used we need to decide when to buy the resource, given that we can ...
P. Krishnan, Philip M. Long, Jeffrey Scott Vitter
ICML
1995
IEEE
14 years 5 months ago
Tracking the Best Expert
Mark Herbster, Manfred K. Warmuth
ICML
1995
IEEE
14 years 5 months ago
Stable Function Approximation in Dynamic Programming
The success ofreinforcement learninginpractical problems depends on the ability to combine function approximation with temporal di erence methods such as value iteration. Experime...
Geoffrey J. Gordon