Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

25

ICASSP
2010
IEEE

favoriteEmaildiscussreport

137views Signal Processing» more ICASSP 2010»

Jointly recognizing multi-speaker conversations

13 years 9 months ago

Jointly recognizing multi-speaker conversations

Download ssli.ee.washington.edu

We suggest an approach to speech recognition where multiple sides of a conversation in a dialog or meeting are processed and decoded jointly rather than independently. We moreover introduce a practical implementation of this approach that demonstrates both language model perplexity and speech recognition word error rate improvements in conversational telephone speech. Speciﬁcally, we show that such beneﬁts can be had if a n-gram language model, in addition to conditioning on immediately preceding words in an utterance, is also allowed to condition on the estimated dialog-act of the immediately preceding utterance of an alternate speaker.

Gang Ji, Jeff Bilmes

Real-time Traffic

Conversational Telephone Speech | ICASSP 2010 | Language Model | Signal Processing | Speech Recognition Word |

claim paper

Related Content

» A multistream ASR framework for BLSTM modeling of conversational speech

» Recognition of Dialogue Acts in Multiparty Meetings Using a Switching DBN

» Multiperson Visual Focus of Attention from Head Pose and Meeting Contextual Cues

» Simultaneous inference of activity pose and object

Post Info
More Details (n/a)

Added	06 Dec 2010
Updated	06 Dec 2010
Type	Conference
Year	2010
Where	ICASSP
Authors	Gang Ji, Jeff Bilmes

Comments (0)