Sciweavers

TASLP
2010
124views more  TASLP 2010»
12 years 11 months ago
Audio Signal Representations for Indexing in the Transform Domain
Indexing audio signals directly in the transform domain can potentially save a significant amount of computation when working on a large database of signals stored in a lossy compr...
Emmanuel Ravelli, Gaël Richard, Laurent Daude...
TASLP
2010
144views more  TASLP 2010»
12 years 11 months ago
Active Learning With Sampling by Uncertainty and Density for Data Annotations
To solve the knowledge bottleneck problem, active learning has been widely used for its ability to automatically select the most informative unlabeled examples for human annotation...
Jingbo Zhu, Huizhen Wang, Benjamin K. Tsou, Matthe...
TASLP
2010
106views more  TASLP 2010»
12 years 11 months ago
Efficient and Robust Music Identification With Weighted Finite-State Transducers
We present an approach to music identification based on weighted finite-state transducers and Gaussian mixture models, inspired by techniques used in large-vocabulary speech recogn...
Mehryar Mohri, Pedro Moreno, Eugene Weinstein
TASLP
2010
157views more  TASLP 2010»
12 years 11 months ago
Multichannel Nonnegative Matrix Factorization in Convolutive Mixtures for Audio Source Separation
Abstract--We consider inference in a general data-driven object-based model of multichannel audio data, assumed generated as a possibly underdetermined convolutive mixture of sourc...
Alexey Ozerov, Cédric Févotte
TASLP
2010
141views more  TASLP 2010»
12 years 11 months ago
Adaptive Harmonic Spectral Decomposition for Multiple Pitch Estimation
Multiple pitch estimation consists of estimating the fundamental frequencies and saliences of pitched sounds over short time frames of an audio signal. This task forms the basis of...
Emmanuel Vincent, Nancy Bertin, Roland Badeau
TASLP
2010
107views more  TASLP 2010»
12 years 11 months ago
A Robust Method to Extract Talker Azimuth Orientation Using a Large-Aperture Microphone Array
Knowing the orientation of a talker in the focal area of a large-aperture microphone array enables the development of better beamforming algorithms (to obtain higher-quality speech...
Avram Levi, Harvey F. Silverman
TASLP
2010
82views more  TASLP 2010»
12 years 11 months ago
Psychoacoustically Constrained and Distortion Minimized Speech Enhancement
Abstract--This paper considers a psychoacoustically constrained and distortion minimized speech enhancement algorithm. Noise reduction, in general, leads to speech distortion, and ...
Seokhwan Jo, Chang D. Yoo
TASLP
2010
142views more  TASLP 2010»
12 years 11 months ago
Beyond the Narrowband Approximation: Wideband Convex Methods for Under-Determined Reverberant Audio Source Separation
We consider the problem of extracting the source signals from an under-determined convolutive mixture assuming known mixing filters. State-of-the-art methods operate in the time-fr...
M. Kowalski, Emmanuel Vincent, Rémi Gribonv...
TASLP
2010
121views more  TASLP 2010»
12 years 11 months ago
Sequential Organization of Speech in Reverberant Environments by Integrating Monaural Grouping and Binaural Localization
Existing binaural approaches to speech segregation place an exclusive burden on cues related to the location of sound sources in space. These approaches can achieve excellent perfo...
John Woodruff, DeLiang Wang
TASLP
2010
130views more  TASLP 2010»
12 years 11 months ago
Developing Objective Measures of Foreign-Accent Conversion
Various methods have recently appeared to transform foreign-accented speech into its native-accented counterpart. Evaluation of these accent conversion methods requires extensive l...
Daniel Felps, Ricardo Gutierrez-Osuna