Sciweavers

MM
2015
ACM
24views Multimedia» more  MM 2015»
9 years 11 months ago
Image Retrieval by Cross-Media Relevance Fusion
How to estimate cross-media relevance between a given query and an unlabeled image is a key question in the MSR-Bing Image Retrieval Challenge. We answer the question by proposing...
Jianfeng Dong, Xirong Li, Shuai Liao, Jieping Xu, ...
MM
2015
ACM
16views Multimedia» more  MM 2015»
9 years 11 months ago
Query-by-Emoji Video Search
This technical demo presents Emoji2Video, a query-by-emoji interface for exploring video collections. Ideogram-based video search and representation presents an opportunity for an...
Spencer Cappallo, Thomas Mensink, Cees G. M. Snoek
MM
2015
ACM
48views Multimedia» more  MM 2015»
9 years 11 months ago
Unsupervised Cosegmentation based on Global Graph Matching
Cosegmentation is defined as the task of segmenting a common object from multiple images. Hitherto, graph matching has been known as a promising approach because of its flexibil...
Takanori Tamanaha, Hideki Nakamaya
MM
2015
ACM
4views Multimedia» more  MM 2015»
9 years 11 months ago
Image2Emoji: Zero-shot Emoji Prediction for Visual Media
We present Image2Emoji, a multi-modal approach for generating emoji labels for an image in a zero-shot manner. Different from existing zero-shot image-to-text approaches, we expl...
Spencer Cappallo, Thomas Mensink, Cees G. M. Snoek
MM
2015
ACM
20views Multimedia» more  MM 2015»
9 years 11 months ago
Bandwidth-aware Prefetching for Proactive Multi-video Preloading and Improved HAS Performance
This paper considers the problem of providing users playing one streaming video the option of instantaneous and seamless playback of alternative videos. Recommendation systems can...
Vengatanathan Krishnamoorthi, Niklas Carlsson, Der...
99
Voted
MM
2015
ACM
14views Multimedia» more  MM 2015»
9 years 11 months ago
Vision-Inertial Hybrid Tracking for Robust and Efficient Augmented Reality on Smartphones
This paper aims at robust and efficient pose tracking for augmented reality on modern smartphones. Existing methods, relying on either vision analysis or motion sensing, are eithe...
Xin Yang, Xun Si, Tangli Xue, Liheng Zhang, Kwang-...
96
Voted
MM
2015
ACM
6views Multimedia» more  MM 2015»
9 years 11 months ago
OmniViewer: Enabling Multi-modal 3D DASH
This paper presents OmniViewer, a multi-modal 3D video streaming system based on Dynamic Adaptive Streaming over HTTP (DASH) standard. OmniViewer allows users to view arbitrary si...
Zhenhuan Gao, Shannon Chen, Klara Nahrstedt
83
Voted
MM
2015
ACM
13views Multimedia» more  MM 2015»
9 years 11 months ago
Exploiting Contextual Information to Enable Efficient Content Delivery for 3D Tele-Immersion Applications
The tradeoff relationship between resource requirement, content complexity, and user satisfaction is magnified when more and more modern 3D Tele-immersive (3DTI) applications with...
Shannon Chen
97
Voted
MM
2015
ACM
11views Multimedia» more  MM 2015»
9 years 11 months ago
Multi-View Visual Recognition of Imperfect Testing Data
A practical yet under-explored problem often encountered by multimedia researchers is the recognition of imperfect testing data, where multiple sensing channels are deployed but i...
Qilin Zhang, Gang Hua