Sciweavers

MM
2005
ACM
178views Multimedia» more  MM 2005»
13 years 10 months ago
Attention region selection with information from professional digital camera
The attentive region extraction is a challenging issue for semantic interpretation of image and video content. The successful attentive region extraction greatly facilitates image...
Song Liu, Liang-Tien Chia, Deepu Rajan
MM
2005
ACM
126views Multimedia» more  MM 2005»
13 years 10 months ago
A corpus-based singing voice synthesis system for mandarin Chinese
In this paper, the design and implementation of a corpus-based singing voice synthesis (SVS) system for Mandarin Chinese was introduced. The design rules of three corpora for sing...
Cheng-Yuan Lin, Tzu-Ying Lin, Jyh-Shing Roger Jang
MM
2005
ACM
157views Multimedia» more  MM 2005»
13 years 10 months ago
Natural language processing of lyrics
We report experiments on the use of standard natural language processing (NLP) tools for the analysis of music lyrics. A significant amount of music audio has lyrics. Lyrics enco...
Jose P. G. Mahedero, Alvaro Martinez, Pedro Cano, ...
MM
2005
ACM
171views Multimedia» more  MM 2005»
13 years 10 months ago
Semantic manifold learning for image retrieval
Learning the user’s semantics for CBIR involves two different sources of information: the similarity relations entailed by the content-based features, and the relevance relatio...
Yen-Yu Lin, Tyng-Luh Liu, Hwann-Tzong Chen
MM
2005
ACM
192views Multimedia» more  MM 2005»
13 years 10 months ago
Video2Cartoon: generating 3D cartoon from broadcast soccer video
In this demonstration, a prototype system for generating 3D cartoon from broadcast soccer video is proposed. This system takes advantage of computer vision (CV) and computer graph...
Dawei Liang, Yang Liu, Qingming Huang, Guangyu Zhu...
MM
2005
ACM
123views Multimedia» more  MM 2005»
13 years 10 months ago
How speech/text alignment benefits web-based learning
This demonstration presents an integrated web-based synchronized scenario for many-to-one cross-media correlations between speech (an EFL, English as Foreign Language, lecture wit...
Sheng-Wei Li, Hao-Tung Lin, Herng-Yow Chen
MM
2005
ACM
126views Multimedia» more  MM 2005»
13 years 10 months ago
A real-time interactive multi-view video system
With the rapid development of electronic and computing technology, multi-view video is attracting extensive interest recently due to its greatly enhanced viewing experience. In th...
Jian-Guang Lou, Hua Cai, Jiang Li
MM
2005
ACM
209views Multimedia» more  MM 2005»
13 years 10 months ago
Learning an image-word embedding for image auto-annotation on the nonlinear latent space
Latent Semantic Analysis (LSA) has shown encouraging performance for the problem of unsupervised image automatic annotation. LSA conducts annotation by keywords propagation on a l...
Wei Liu, Xiaoou Tang