Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

15

ICMI
2004
Springer

favoriteEmaildiscussreport

151views Biometrics» more ICMI 2004»

Multimodal model integration for sentence unit detection

13 years 10 months ago

Multimodal model integration for sentence unit detection

Download www.speech.sri.com

In this paper, we adopt a direct modeling approach to utilize conversational gesture cues in detecting sentence boundaries, called SUs, in video taped conversations. We treat the detection of SUs as a classiﬁcation task such that for each inter-word boundary, the classiﬁer decides whether there is an SU boundary or not. In addition to gesture cues, we also utilize prosody and lexical knowledge sources. In this ﬁrst investigation, we ﬁnd that gesture features complement the prosodic and lexical knowledge sources for this task. By using all of the knowledge sources, the model is able to achieve the lowest overall SU detection error rate. Categories and Subject Descriptors: H.5.1 [Multimedia Information Systems] Audio and Video Input, H.5.5 [Sound and Music Computing] Modeling and Signal Analysis, I.2.7 [Natural Language Processing] Dialog Processing General Terms: Algorithms, Performance, Experimentation, Languages.

Mary P. Harper, Elizabeth Shriberg

Real-time Traffic

Gesture Cues | ICMI 2004 | Knowledge Sources | Lexical Knowledge Sources |

claim paper

Related Content

» Comparing and Combining Generative and Posterior Probability Models Some Advances in Sente...

» A hierarchical taggraph search scheme with layered grammar rules for spontaneous speech un...

» Integrating Proximity to Subjective Sentences for Blog Opinion Retrieval

» ProsodyBased Automatic Segmentation of Speech into Sentences and Topics

» Conversational Implicatures in Indirect Replies

» A Single Generative Model for Joint Morphological Segmentation and Syntactic Parsing

» Audiovisual integration for tennis broadcast structuring

» Modelling and analyzing multimodal dyadic interactions using social networks

» A Probabilistic Multimodal Sensor Aggregation Scheme Applied for a Mobile Robot

Post Info
More Details (n/a)

Added	01 Jul 2010
Updated	01 Jul 2010
Type	Conference
Year	2004
Where	ICMI
Authors	Mary P. Harper, Elizabeth Shriberg

Comments (0)