Sciweavers

ICMCS
2009
IEEE

Boosting multi-modal camera selection with semantic features

13 years 2 months ago
Boosting multi-modal camera selection with semantic features
In this work semantic features are used to improve the results of the camera selection. These semantic features are group action, person action and person speaking. For this purpose low level acoustic and visual features are combined with high level semantic ones. After the feature fusion, a segmentation and classification are performed by Hidden Markov Models. The evaluation shows that an absolute improvement of 6.5% can be achieved. The frame error rate is reduced to 38.1% by using acoustic and all semantic features. The best model using only low level features achieves a frame error rate of 44.6%, which is the best one reported on this data set.
Benedikt Hörnler, Dejan Arsic, Björn Sch
Added 19 Feb 2011
Updated 19 Feb 2011
Type Journal
Year 2009
Where ICMCS
Authors Benedikt Hörnler, Dejan Arsic, Björn Schuller, Gerhard Rigoll
Comments (0)