Search Sciweavers | Sciweavers

22

ICML
2006
IEEE

256views Machine Learning» more ICML 2006»

Automatic basis function construction for approximate dynamic programming and reinforcement learning

13 years 11 months ago

We address the problem of automatically constructing basis functions for linear approximation of the value function of a Markov Decision Process (MDP). Our work builds on results ...

Philipp W. Keller, Shie Mannor, Doina Precup

claim paper

Read More »

15

click to vote

ICML
1999
IEEE

114views Machine Learning» more ICML 1999»

Simple DFA are Polynomially Probably Exactly Learnable from Simple Examples

14 years 6 months ago

Download www.cs.iastate.edu

E cient learning of DFA is a challenging research problem in grammatical inference. Both exact and approximate (in the PAC sense) identi ability of DFA from examples is known to b...

Rajesh Parekh, Vasant Honavar

claim paper

Read More »

18

click to vote

ICML
1995
IEEE

196views Machine Learning» more ICML 1995»

Ant-Q: A Reinforcement Learning Approach to the Traveling Salesman Problem

14 years 6 months ago

Download www.idsia.ch

In this paper we introduce Ant-Q, a family of algorithms which present many similarities with Q-learning (Watkins, 1989), and which we apply to the solution of symmetric and asymm...

Luca Maria Gambardella, Marco Dorigo

claim paper

Read More »

10

click to vote

ICML
1989
IEEE

103views Machine Learning» more ICML 1989»

Uncertainty Based Selection of Learning Experiences

13 years 9 months ago

Download www.cs.technion.ac.il

The training experiences needed by a learning system may be selected by either an external agent or the system itself. We show that knowledge of the current state of the learner&#...

Paul D. Scott, Shaul Markovitch

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers