157
click to vote
ICML
15 years 7 months ago
1994 IEEE
Conservation of information (COI) popularized by the no free lunch theorem is a great leveler of search algorithms, showing that on average no search outperforms any other. Yet in ...
147
click to vote
ICML
15 years 7 months ago
1994 IEEE
Reinforcement learning (RL) algorithms provide a sound theoretical basis for building learning control architectures for embedded agents. Unfortunately all of the theory and much ...
145
click to vote
ICML
15 years 7 months ago
1994 IEEE
We explore algorithms for learning classification procedures that attempt to minimize the cost of misclassifying examples. First, we consider inductive learning of classification ...
144
click to vote
ICML
15 years 7 months ago
1994 IEEE
With the goal of reducing computational costs without sacrificing accuracy, we describe two algorithms to find sets of prototypes for nearest neighbor classification. Here, the te...
143
click to vote
ICML
15 years 7 months ago
1994 IEEE
In the Markov decision process (MDP) formalization of reinforcement learning, a single adaptive agent interacts with an environment defined by a probabilistic transition function....
|