In many pattern recognition tasks, given some input data and a family of models, the “best” model is defined as the one which maximizes the likelihood of the data given the m...
Tara N. Sainath, Dimitri Kanevsky, Bhuvana Ramabha...
— A method for segmentation and recognition of human body behavior data is proposed. Recognition of human body movements is getting larger interests in robotic research field, s...
This paper focuses on the integration of multimodal features for sport video structure analysis. The method relies on a statistical model which takes into account both the shot co...
We examine in some detail Mel Frequency Cepstral Coefficients (MFCCs) - the dominant features used for speech recognition - and investigate their applicability to modeling music. ...
A principal problem in speech recognition is distinguishing between words and phrases that sound similar but have different meanings. Speech recognition programs produce a list of...
Henry Lieberman, Alexander Faaborg, Waseem Daher, ...