Sciweavers

ICASSP
2010
IEEE
13 years 5 months ago
Learning-based auditory encoding for robust speech recognition
Yu-Hsiang Bosco Chiu, Bhiksha Raj, Richard M. Ster...
ICASSP
2010
IEEE
13 years 5 months ago
A new penalty term for the BIC with respect to speaker diarization
In this paper we revise the penalty term of the Bayesian Information Criterion (BIC). Based on our previous approach to penalize each cluster only with its corresponding effective...
Themos Stafylakis, Georgios Tzimiropoulos, Vassili...
ICASSP
2010
IEEE
13 years 5 months ago
Learning deep rhetorical structure for extractive speech summarization
Extractive summarization of conference and lecture speech is useful for online learning and references. We show for the first time that deep(er) rhetorical parsing of conference ...
Justin Jian Zhang, Pascale Fung
ICASSP
2010
IEEE
13 years 5 months ago
Statistical approach to enhancing esophageal speech based on Gaussian mixture models
This paper presents a novel method of enhancing esophageal speech using statistical voice conversion. Esophageal speech is one of the alternative speaking methods for laryngectome...
Hironori Doi, Keigo Nakamura, Tomoki Toda, Hiroshi...
ICASSP
2010
IEEE
13 years 5 months ago
Quantization and compensation in sampled interleaved multi-channel systems
This paper considers the environment of interleaved, multi-channel measurements as arises for example in time-interleaved A/D converters and in distributed sensor networks. Such s...
Shay Maymon, Alan V. Oppenheim
ICASSP
2010
IEEE
13 years 5 months ago
A nullspace analysis of the nuclear norm heuristic for rank minimization
The problem of minimizing the rank of a matrix subject to linear equality constraints arises in applications in machine learning, dimensionality reduction, and control theory, and...
Krishnamurthy Dvijotham, Maryam Fazel
ICASSP
2010
IEEE
13 years 5 months ago
Estimation of a white Gaussian noise in the Short Time Fourier Transform based on the spectral kurtosis of the minimal statistic
In this paper we present a noise level estimator using minimal values of the Short Time Fourier Transform of a signal embedded in a white Gaussian noise. The spectral kurtosis of ...
Fabien Millioz, Nadine Martin
ICASSP
2010
IEEE
13 years 5 months ago
Audio-based nonlinear video diffusion
We propose a novel non-linear video diffusion approach which is able to focus on parts of a video sequence that are relevant for applications in audio-visual analysis. The diffusi...
Anna Llagostera Casanovas, Pierre Vandergheynst
ICASSP
2010
IEEE
13 years 5 months ago
Singing information processing based on singing voice modeling
In this paper, we propose a novel area of research referred to as singing information processing. To shape the concept of this area, we first introduce singing understanding syst...
Masataka Goto, Takeshi Saitou, Tomoyasu Nakano, Hi...
ICASSP
2010
IEEE
13 years 5 months ago
Ranging energy optimization for robust sensor positioning with collaborative anchors
We propose a sensor positioning scheme for a wireless sensor network consisting of beacons as well as collaborative anchors (CA) to help sensors within a prescribed service area t...
Tao Wang, Geert Leus