Deriving image-text document surrogates to optimize cognition

13 years 11 months ago

Download www.ecologylab.net

The representation of information collections needs to be optimized for human cognition. While documents often include rich visual components, collections, including personal collections and those generated by search engines, are typically represented by lists of text-only surrogates. By concurrently invoking complementary components of human cognition, combined image-text surrogates will help people to more effectively see, understand, think about, and remember an information collection. This research develops algorithmic methods that use the structural context of images in HTML documents to associate meaningful text and thus derive combined image-text surrogates. Our algorithm first recognizes which documents consist essentially of informative and multimedia content. Then, the algorithm recognizes the informative sub-trees within each such document, discards advertisements and navigation, and extracts images with contextual descriptions. Experimental results demonstrate the algorith...

Eunyee Koh, Andruid Kerne

Real-time Traffic

DOCENG 2009 | Document Analysis | Human Cognition | Image-text Surrogates | Text-only Surrogates |

claim paper

Post Info
More Details (n/a)

Added	28 May 2010
Updated	28 May 2010
Type	Conference
Year	2009
Where	DOCENG
Authors	Eunyee Koh, Andruid Kerne

Comments (0)

Sciweavers

Deriving image-text document surrogates to optimize cognition

DOCENG 2009 | Document Analysis | Human Cognition | Image-text Surrogates | Text-only Surrogates |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers