Audio-Visual Affect Recognition through Multi-Stream Fused HMM for HCI

13 years 10 months ago

Download www.ifp.illinois.edu

Advances in computer processing power and emerging algorithms are allowing new ways of envisioning Human Computer Interaction. This paper focuses on the development of a computing algorithm that uses audio and visual sensors to detect and track a user's affective state to aid computer decision making. Using our Multi-stream Fused Hidden Markov Model (MFHMM), we analyzed coupled audio and visual streams to detect 11 cognitive/emotive states. The MFHMM allows the building of an optimal connection among multiple streams according to the maximum entropy principle and the maximum mutual information criterion. Person-independent experimental results from 20 subjects in 660 sequences show that the MFHMM approach performs with an accuracy of 80.61% which outperforms face-only HMM, pitch-only HMM, energy-only HMM, and independent HMM fusion.

Zhihong Zeng, Jilin Tu, Brian Pianfetti, Ming Liu,

Real-time Traffic

Computer Vision | CVPR 2005 | Fused Hidden Markov | Independent Hmm Fusion | Maximum Entropy Principle |

claim paper

Post Info
More Details (n/a)

Added	24 Jun 2010
Updated	24 Jun 2010
Type	Conference
Year	2005
Where	CVPR
Authors	Zhihong Zeng, Jilin Tu, Brian Pianfetti, Ming Liu, Tong Zhang, ZhenQiu Zhang, Thomas S. Huang, Stephen E. Levinson

Comments (0)

Sciweavers

Audio-Visual Affect Recognition through Multi-Stream Fused HMM for HCI

Computer Vision | CVPR 2005 | Fused Hidden Markov | Independent Hmm Fusion | Maximum Entropy Principle |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers