Layered Hypernetwork Models for Cross-Modal Associative Text and Image Keyword Generation in Multimodal Information Retrieval

15 years 3 months ago

Download bi.snu.ac.kr

Conventional methods for multimodal data retrieval use text-tag based or cross-modal approaches such as tag-image co-occurrence and canonical correlation analysis. Since there are differences of granularity in text and image features, however, approaches based on lower-order relationship between modalities may have limitations. Here, we propose a novel text and image keyword generation method by cross-modal associative learning and inference with multimodal queries. We use a modified hypernetwork model, i.e. layered hypernetworks (LHNs) which consists of the first (lower) layer and the second (upper) layer which has more than two modality-dependent hypernetworks and one modality-integrating hypernetwork, respectively. LHNs learn higher-order associative relationships between text and image modalities by training on an example set. After training, LHNs are used to extend multimodal queries by generating text and image keywords via cross-modal inference, i.e. text-toimage and image-to-te...

JungWoo Ha, Byoung-Hee Kim, Bado Lee, Byoung-Tak Z

Real-time Traffic

Artificial Intelligence | Images | Keyword Generation | Multimodal Queries | PRICAI 2010 |

claim paper

Added	29 Jan 2011
Updated	29 Jan 2011
Type	Journal
Year	2010
Where	PRICAI
Authors	JungWoo Ha, Byoung-Hee Kim, Bado Lee, Byoung-Tak Zhang

Sciweavers

Layered Hypernetwork Models for Cross-Modal Associative Text and Image Keyword Generation in Multimodal Information Retrieval

Artificial Intelligence | Images | Keyword Generation | Multimodal Queries | PRICAI 2010 |

Explore & Download

Productivity Tools

Sciweavers