In this paper we explore the relationship between the temporal and rhythmic structure of musical audio signals. Using automatically extracted rhythmic structure we present a rhyth...
Norberto Degara, Aantonio Pena, Matthew E. P. Davi...
Algorithms such as Least Median of Squares (LMedS) and Random Sample Consensus (RANSAC) have been very successful for low-dimensional robust regression problems. However, the comb...
Browsing through collections of audio recordings of conversation nominally relies on the processing of participants’ lexical productions. The evolving verbal and non-verbal cont...
We present a novel discriminative training algorithm for n-gram language models for use in large vocabulary continuous speech recognition. The algorithm uses large margin estimati...
We investigate whether Amazon’s Mechanical Turk (MTurk) service can be used as a reliable method for transcription of spoken language data. Utterances with varying speaker demog...
Matthew Marge, Satanjeev Banerjee, Alexander I. Ru...
Coordinated Multi-Point transmission and relaying are two likely candidates for the upcoming LTE-Advanced standard as both are able to satisfy the ever increasing demands for ubiq...
We present a visual saliency detection method and its applications. The proposed method does not require prior knowledge (learning) or any pre-processing step. Local visual descri...
Vast segments of the frequency spectrum are licensed to specific users for particular applications. These legacy users, however, often under-utilize their designated spectrum seg...
Random projection has been suggested as a means of dimensionality reduction, where the original data are projected onto a subspace using a random matrix. It represents a computati...
Tetsuya Takiguchi, Jeff Bilmes, Mariko Yoshii, Yas...