Sciweavers

TASLP
2010
135views more  TASLP 2010»
12 years 11 months ago
Audio-Based Semantic Concept Classification for Consumer Video
Abstract--This paper presents a novel method for automatically classifying consumer video clips based on their soundtracks. We use a set of 25 overlapping semantic classes, chosen ...
Keansub Lee, Daniel P. W. Ellis
TASLP
2010
134views more  TASLP 2010»
12 years 11 months ago
Multiple Fundamental Frequency Estimation by Modeling Spectral Peaks and Non-Peak Regions
This paper presents a maximum likelihood approach to multiple fundamental frequency (F0) estimation for a mixture of harmonic sound sources, where the power spectrum of a time fra...
Zhiyao Duan, Bryan Pardo, Changshui Zhang
TASLP
2010
138views more  TASLP 2010»
12 years 11 months ago
Glimpsing IVA: A Framework for Overcomplete/Complete/Undercomplete Convolutive Source Separation
Abstract--Independent vector analysis (IVA) is a method for separating convolutedly mixed signals that significantly reduces the occurrence of the well-known permutation problem in...
Alireza Masnadi-Shirazi, Wenyi Zhang, Bhaskar D. R...
TASLP
2010
167views more  TASLP 2010»
12 years 11 months ago
Broadband Source Localization From an Eigenanalysis Perspective
Abstract--Broadband source localization has several applications ranging from automatic video camera steering to target signal tracking and enhancement through beamforming. Consequ...
Mehrez Souden, Jacob Benesty, Sofiène Affes
TASLP
2010
101views more  TASLP 2010»
12 years 11 months ago
Gaussian Model-Based Multichannel Speech Presence Probability
The knowledge of the target speech presence probability in a mixture of signals captured by a speech communication system is of paramount importance in several applications includi...
Mehrez Souden, Jingdong Chen, Jacob Benesty, Sofi&...
TASLP
2010
132views more  TASLP 2010»
12 years 11 months ago
Using Reverberation to Improve Range and Elevation Discrimination for Small Array Sound Source Localization
Sound source localization (SSL) is an essential task in many applications involving speech capture and enhancement. As such, speaker localization with microphone arrays has receive...
Flavio Ribeiro, Cha Zhang, Dinei A. F. Florê...
TASLP
2010
117views more  TASLP 2010»
12 years 11 months ago
Speech Enhancement Using Gaussian Scale Mixture Models
This paper presents a novel probabilistic approach to speech enhancement. Instead of a deterministic logarithmic relationship, we assume a probabilistic relationship between the fr...
Jiucang Hao, Te-Won Lee, Terrence J. Sejnowski
TASLP
2010
122views more  TASLP 2010»
12 years 11 months ago
Error Approximation and Minimum Phone Error Acoustic Model Estimation
Minimum phone error (MPE) acoustic parameter estimation involves calculation of edit distances (errors) between correct and incorrect hypotheses. In the context of large vocabulary...
Matt Gibson 0002, Thomas Hain
TASLP
2010
96views more  TASLP 2010»
12 years 11 months ago
Evaluating Source Separation Algorithms With Reverberant Speech
This paper examines the performance of several source separation systems on a speech separation task for which human intelligibility has previously been measured. For anechoic mixt...
Michael I. Mandel, S. Bressler, Barbara G. Shinn-C...
TASLP
2010
148views more  TASLP 2010»
12 years 11 months ago
Batch and Adaptive PARAFAC-Based Blind Separation of Convolutive Speech Mixtures
We present a frequency-domain technique based on PARAllel FACtor (PARAFAC) analysis that performs multichannel blind source separation (BSS) of convolutive speech mixtures. PARAFAC...
Dimitri Nion, Kleanthis N. Mokios, Nicholas D. Sid...