Sciweavers

ICASSP
2011
IEEE
12 years 8 months ago
Training of error-corrective model for ASR without using audio data
This paper introduces a method to train an error-corrective model for Automatic Speech Recognition (ASR) without using audio data. In existing techniques, it is assumed that suf...
Gakuto Kurata, Nobuyasu Itoh, Masafumi Nishimura
CSL
2002
Springer
13 years 4 months ago
Lightly supervised and unsupervised acoustic model training
The last decade has witnessed substantial progress in speech recognition technology, with todays state-of-the-art systems being able to transcribe unrestricted broadcast news audi...
Lori Lamel, Jean-Luc Gauvain, Gilles Adda
PRIS
2004
13 years 6 months ago
Pattern Recognition Algorithms for Polyphonic Music Transcription
Abstract. The main area of work in computer music related to information systems is known as music information retrieval (MIR). Databases containing musical information can be clas...
Antonio Pertusa, José Manuel Iñesta ...
EVENT
2001
140views more  EVENT 2001»
13 years 6 months ago
Multimodal 3-D Tracking and Event Detection via the Particle Filter
Determining the occurrence of an event is fundamental to developing systems that can observe and react to them. Often, this determination is based on collecting video and/or audio...
Dmitry N. Zotkin, Ramani Duraiswami, Larry S. Davi...
ACL
2001
13 years 6 months ago
Processing Broadcast Audio for Information Access
This paper addresses recent progress in speaker-independent, large vocabulary, continuous speech recognition, which has opened up a wide range of near and mid-term applications. O...
Jean-Luc Gauvain, Lori Lamel, Gilles Adda, Martine...
CEAS
2008
Springer
13 years 7 months ago
Analysis of Spectral Parameters of Audio Signals for the Identification of Spam Over IP Telephony
A method is presented which analyses the audio speech data of voice calls and calculates an "acoustic fingerprint". The audio data of the voice calls are compared with e...
Christoph Pörschmann, Heiko Knospe
ISMIS
2005
Springer
13 years 10 months ago
Extracting Emotions from Music Data
Abstract. Music is not only a set of sounds, it evokes emotions, subjectively perceived by listeners. The growing amount of audio data available on CDs and in the Internet wakes up...
Alicja Wieczorkowska, Piotr Synak, Rory A. Lewis, ...
ICMCS
2005
IEEE
105views Multimedia» more  ICMCS 2005»
13 years 10 months ago
Persistent audio modelling for background determination
This paper is concerned with modelling background audio online to detect foreground sounds in complex audio environments for surveillance and smart home applications. We examine a...
Simon Moncrieff, Svetha Venkatesh, Geoff A. W. Wes...
ICMCS
2006
IEEE
177views Multimedia» more  ICMCS 2006»
13 years 11 months ago
Mixed Type Audio Classification with Support Vector Machine
Content-based classification of audio data is an important problem for various applications such as overall analysis of audio-visual streams, boundary detection of video story se...
Lei Chen 0002, Sule Gündüz, M. Tamer &Ou...
CNSR
2008
IEEE
151views Communications» more  CNSR 2008»
13 years 11 months ago
Discrete Model to Estimate Lifetime of a Wireless Sensor Network for Audio Storage
— Wireless sensor networks (WSNs) can be used to record and store audio data at remote and inaccessible places. However, audio data adds an additional concern to the design of th...
Sajid Hussain, Patrick Drane, Michael Mallinson