We consider approximate policy evaluation for finite state and action Markov decision processes (MDP) in the off-policy learning context and with the simulation-based least square...
The research presented in this paper is focused on global tempo transformations of music performances. We are investigating the problem of how a performance played at a particular...
: This paper introduces a system for real-time incremental learning in a call-centre environment. The classifier used is a Support Vector Machine (SVM) and it is applied to telepho...
Donn Morrison, Ruili Wang, W. L. Xu, Liyanage C. D...
A novel way to simulate Turing Machines (TMs) by Artificial Neural Networks (ANNs) is proposed. We claim that the proposed simulation is in agreement with the correct interpretatio...
We present new algorithms for reinforcement learning, and prove that they have polynomial bounds on the resources required to achieve near-optimal return in general Markov decisio...