Sciweavers

TASLP
2008
96views more  TASLP 2008»
13 years 4 months ago
Binaural Tracking of Multiple Moving Sources
Abstract--This paper addresses the problem of tracking multiple moving sources using binaural input. We observe that binaural cues are strongly correlated with source locations in ...
N. Roman, DeLiang Wang
TASLP
2008
61views more  TASLP 2008»
13 years 4 months ago
Spectral Representations of Nonmodal Phonation
Abstract--Regions of nonmodal phonation, which exhibit deviations from uniform glottal-pulse periods and amplitudes, occur often in speech and convey information about linguistic c...
Nicolas Malyska, Thomas F. Quatieri
TASLP
2008
89views more  TASLP 2008»
13 years 4 months ago
A Cascaded Broadcast News Highlighter
Abstract-- This paper presents a fully automatic news skimming system which takes a broadcast news audio stream and provides the user with the segmented, structured and highlighted...
Heidi Christensen, Yoshihiko Gotoh, Steve Renals
TASLP
2008
81views more  TASLP 2008»
13 years 4 months ago
Instrument-Specific Harmonic Atoms for Mid-Level Music Representation
Several studies have pointed out the need for accurate mid-level representations of music signals for information retrieval and signal processing purposes. In this paper, we propos...
Pierre Leveau, Emmanuel Vincent, Gaël Richard...
TASLP
2008
150views more  TASLP 2008»
13 years 4 months ago
Automatic Prosodic Event Detection Using Acoustic, Lexical, and Syntactic Evidence
With the advent of prosody annotation standards such as tones and break indices (ToBI), speech technologists and linguists alike have been interested in automatically detecting pro...
Sankaranarayanan Ananthakrishnan, Shrikanth S. Nar...
TASLP
2008
105views more  TASLP 2008»
13 years 4 months ago
Optimizing the Performance of Spoken Language Recognition With Discriminative Training
The performance of spoken language recognition system is typically formulated to reflect the detection cost and the strategic decision points along the detection-error-tradeoff cur...
Donglai Zhu, Haizhou Li, Bin Ma, Chin-Hui Lee
TASLP
2008
149views more  TASLP 2008»
13 years 4 months ago
Using Articulatory Representations to Detect Segmental Errors in Nonnative Pronunciation
Abstract--Motivated by potential applications in second-language pedagogy, we present a novel approach to using articulatory information to improve automatic detection of typical p...
Joseph Tepperman, Shrikanth Narayanan
TASLP
2008
102views more  TASLP 2008»
13 years 4 months ago
On the Importance of the Pearson Correlation Coefficient in Noise Reduction
Noise reduction, which aims at estimating a clean speech from noisy observations, has attracted a considerable amount of research and engineering attention over the past few decade...
Jacob Benesty, Jingdong Chen, Yiteng Huang
TASLP
2008
82views more  TASLP 2008»
13 years 4 months ago
Union of MDCT Bases for Audio Coding
This paper investigates the use of sparse overcomplete decompositions for audio coding. Audio signals are decomposed over a redundant union of modified discrete cosine transform (M...
Emmanuel Ravelli, Gaël Richard, Laurent Daude...
TASLP
2008
96views more  TASLP 2008»
13 years 4 months ago
Fast Tracing of Acoustic Beams and Paths Through Visibility Lookup
The beam tracing method can be used for the fast tracing of a large number of acoustic paths through a direct lookup of a special tree-like data structure (beam tree) that describe...
Fabio Antonacci, M. Foco, Augusto Sarti, Stefano T...