Let X be a non-negative random variable and let the conditional distribution of a random variable Y , given X, be Poisson(γ · X), for a parameter γ ≥ 0. We identify a natural...
Many real-world problems are inherently hierarchically structured. The use of this structure in an agent’s policy may well be the key to improved scalability and higher performa...
Let A be a self-adjoint operator acting over a space X endowed with a partition. We give lower bounds on the energy of a mixed state from its distribution in the partition and the...
Abstract—The wavelet entropy (WE) of rest electroencephalogram (EEG) and of event-related potentials (ERP) carries information about the degree of order or disorder associated wi...
Giorgos A. Giannakakis, Nikolaos N. Tsiaparas, Mon...
Policy search is a successful approach to reinforcement learning. However, policy improvements often result in the loss of information. Hence, it has been marred by premature conv...