Detecting the time of occurrence of an acoustic event (for instance, a cheer) embedded in a longer soundtrack is useful and important for applications such as search and retrieval...
Keansub Lee, Daniel P. W. Ellis, Alexander C. Loui
In this paper we develop an algorithm for action recognition and localization in videos. The algorithm uses a figurecentric visual word representation. Different from previous ap...
The paper explores a very simple agent design method called Q-decomposition, wherein a complex agent is built from simpler subagents. Each subagent has its own reward function and...
When modeling high-dimensional richly structured data, it is often the case that the distribution defined by the Deep Boltzmann Machine (DBM) has a rough energy landscape with man...
Ensemble learning is a variational Bayesian method in which an intractable distribution is approximated by a lower-bound. Ensemble learning results in models with better generaliz...