Sciweavers

CSO
2009
IEEE

Automatic Extraction of Spoken Word in Broadcast Media Language

13 years 11 months ago
Automatic Extraction of Spoken Word in Broadcast Media Language
Compared with the written word, few experts pay more attention to the spoken word because of the difficulty of obtaining spoken corpora. In order to develop and improve the spoken words research, this paper proposes a novel method for automatic extraction spoken words in broadcasting language, and the result is impressive. From analysis of the result, we extracted 3009 spoken words by the model on word usage frequency of spatial distribution, and obtain a correct extraction rate over 85% in part I data and 76.5% in part II respectively. The word usage frequency of spatial distribution model can effectively extract and distinguish the spoken words from broadcast media language.
Yuqiang Zhang, Yu Zou, Wei He, Min Hou, Yonglin Te
Added 20 May 2010
Updated 20 May 2010
Type Conference
Year 2009
Where CSO
Authors Yuqiang Zhang, Yu Zou, Wei He, Min Hou, Yonglin Teng
Comments (0)