This paper investigates a novel model-free reinforcement learning architecture, the Natural Actor-Critic. The actor updates are based on stochastic policy gradients employing Amari...
We present a method for transferring knowledge learned in one task to a related task. Our problem solvers employ reinforcement learning to acquire a model for one task. We then tra...
Lisa Torrey, Trevor Walker, Jude W. Shavlik, Richa...
One of the main research concern in neural networks is to find the appropriate network size in order to minimize the trade-off between overfitting and poor approximation. In this ...
: AHP is proposed to give the importance grade with respect to many items. The comparison value that is the element of a comparison matirx is used to be crisp, however, it is easy ...
This communication deals with data reduction and regression. A set of high dimensional data (e.g., images) usually has only a few degrees of freedom with corresponding variables t...
Matthieu Brucher, Christian Heinrich, Fabrice Heit...