Sciweavers

ICASSP
2009
IEEE
13 years 12 months ago
Contrasting emotion-bearing laughter types in multiparticipant vocal activity detection for meetings
The detection of laughter in conversational interaction presents an important challenge in meeting understanding, important primarily because laughter is predictive of the emotion...
Kornel Laskowski
ICASSP
2009
IEEE
13 years 12 months ago
Multi-modal activity and dominance detection in smart meeting rooms
In this paper a new approach for activity and dominance modeling in meetings is presented. For this purpose low level acoustic and visual features are extracted from audio and vid...
Benedikt Hörnler, Gerhard Rigoll
ICASSP
2009
IEEE
13 years 12 months ago
Robust video fingerprinting based on visual attention regions
This paper presents a robust video fingerprinting based on visual attention regions. Video fingerprints, which are a set of short feature vectors, are unique to video clips and us...
Xing Su, Tiejun Huang, Wen Gao
ICASSP
2009
IEEE
13 years 12 months ago
On the phonetic information in ultrasonic microphone signals
We study the phonetic information in the signal from an ultrasonic “microphone”, a device that emits an ultrasonic wave toward a speaker and receives the reflected, Doppler-s...
Karen Livescu, Bo Zhu, James R. Glass
ICASSP
2009
IEEE
13 years 12 months ago
Energy-efficient graph-based wavelets for distributed coding in Wireless Sensor Networks
This work presents a class of unidirectional lifting-based wavelet transforms for an arbitrary communication graph in a wireless sensor network. These transforms are unidirectiona...
Godwin Shen, Sundeep Pattem, Antonio Ortega
ICASSP
2009
IEEE
13 years 12 months ago
Robust speech dereverberation based on non-negativity and sparse nature of speech spectrograms
This paper presents a blind dereverberation method designed to recover the subband envelope of an original speech signal from its reverberant version. The problem is formulated as...
Hirokazu Kameoka, Tomohiro Nakatani, Takuya Yoshio...
ICASSP
2009
IEEE
13 years 12 months ago
Data-driven lexicon expansion for Mandarin broadcast news and conversation speech recognition
We present a data-driven framework for expanding the lexicon to improve Mandarin broadcast news and conversation speech recognition. The lexicon expansion includes the generation ...
Xin Lei, Wen Wang, Stolcke Stolcke
ICASSP
2009
IEEE
13 years 12 months ago
Psychoacoustically constrained and distortion minimized speech enhancement algorithm
A psychoacoustically constrained and distortion minimized speech enhancement algorithm is considered. In general, noise reduction leads to speech distortion, and thus, the goal of...
Seokhwan Jo, Chang D. Yoo
ICASSP
2009
IEEE
13 years 12 months ago
Periodic event detection and recognition in video
Periodicity attracts special attention in human cognition. Hence it is important to consider that in automatic analysis of motion events. This paper presents a method for represen...
E. P. Vivek, Erik Pogalin, Arnold W. M. Smeulders
ICASSP
2009
IEEE
13 years 12 months ago
Extensions of absolute discounting (Kneser-Ney method)
Jesús Andrés-Ferrer, Hermann Ney