Sciweavers

MLMI
2007
Springer
13 years 10 months ago
Modeling Vocal Interaction for Segmentation in Meeting Recognition
Automatic segmentation is an important technology for both automatic speech recognition and automatic speech understanding. In meetings, participants typically vocalize for only a ...
Kornel Laskowski, Tanja Schultz
MLMI
2007
Springer
13 years 10 months ago
Frequency Domain Linear Prediction for QMF Sub-bands and Applications to Audio Coding
Abstract. This paper proposes an analysis technique for wide-band audio applications based on the predictability of the temporal evolution of Quadrature Mirror Filter (QMF) sub-ban...
Petr Motlícek, Sriram Ganapathy, Hynek Herm...
MLMI
2007
Springer
13 years 10 months ago
Spoken Term Detection System Based on Combination of LVCSR and Phonetic Search
Igor Szöke, Michal Fapso, Martin Karafi&aacut...
MLMI
2007
Springer
13 years 10 months ago
Towards an Objective Test for Meeting Browsers: The BET4TQB Pilot Experiment
This paper outlines first the BET method for task-based evaluation of meeting browsers. ‘Observations of interest’ in meetings are empirically determined by neutral observers ...
Andrei Popescu-Belis, Philippe Baudrion, Mike Flyn...
MLMI
2007
Springer
13 years 10 months ago
Czech Text-to-Sign Speech Synthesizer
Zdenek Krnoul, Jakub Kanis, Milos Zelezný, ...
MLMI
2007
Springer
13 years 10 months ago
An Ego-Centric and Tangible Approach to Meeting Indexing and Browsing
Abstract. This article presents an ego-centric approach for indexing and browsing meetings. The method considers two concepts: meetings’ data alignment with personal information ...
Denis Lalanne, Florian Evéquoz, Maurizio Ri...
MLMI
2007
Springer
13 years 10 months ago
Meeting State Recognition from Visual and Aural Labels
In this paper we present a meeting state recognizer based on a combination of multi-modal sensor data in a smart room. Our approach is based on the training of a statistical model ...
Jan Curín, Pascal Fleury, Jan Kleindienst, ...
MLMI
2007
Springer
13 years 10 months ago
Transfer Learning for Tandem ASR Feature Extraction
Joe Frankel, Özgür Çetin, Nelson ...
MLMI
2007
Springer
13 years 10 months ago
Binaural Speech Separation Using Recurrent Timing Neural Networks for Joint F0-Localisation Estimation
A speech separation system is described in which sources are represented in a joint interaural time difference-fundamental frequency (ITD-F0) cue space. Traditionally, recurrent t...
Stuart N. Wrigley, Guy J. Brown
MLMI
2007
Springer
13 years 10 months ago
Automatic Annotation of Dialogue Structure from Simple User Interaction
Abstract. In [1], we presented a method for automatic detection of action items from natural conversation. This method relies on supervised classification techniques that are trai...
Matthew Purver, John Niekrasz, Patrick Ehlen