Sciweavers

42 search results - page 8 / 9
» A Laplacian Approach to Multi-Oriented Text Detection in Vid...
Sort
View
ICMCS
2007
IEEE
147views Multimedia» more  ICMCS 2007»
13 years 11 months ago
Alignment of Speech to Highly Imperfect Text Transcriptions
We introduce a novel and inexpensive approach for the temporal alignment of speech to highly imperfect transcripts from automatic speech recognition (ASR). Transcripts are generat...
Alexander Haubold, John R. Kender
TCSV
2011
13 years 6 days ago
Concept-Driven Multi-Modality Fusion for Video Search
—As it is true for human perception that we gather information from different sources in natural and multi-modality forms, learning from multi-modalities has become an effective ...
Xiao-Yong Wei, Yu-Gang Jiang, Chong-Wah Ngo
ICMCS
2005
IEEE
105views Multimedia» more  ICMCS 2005»
13 years 11 months ago
Speech-Based Visual Concept Learning Using Wordnet
Modeling visual concepts using supervised or unsupervised machine learning approaches are becoming increasing important for video semantic indexing, retrieval, and filtering appli...
Xiaodan Song, Ching-Yung Lin, Ming-Ting Sun
MMS
2007
13 years 4 months ago
Automatic lyrics alignment for Cantonese popular music
Abstract From lyrics-display on electronic music players and Karaoke videos to surtitles for live Chinese opera performance, one feature is common to all these everyday functionali...
Chi Hang Wong, Wai Man Szeto, Kin Hong Wong
DAC
2004
ACM
14 years 6 months ago
Proxy-based task partitioning of watermarking algorithms for reducing energy consumption in mobile devices
Digital watermarking is a process that embeds an imperceptible signature or watermark in a digital file containing audio, image, text or video data. The watermark is later used to...
Arun Kejariwal, Sumit Gupta, Alexandru Nicolau, Ni...