Sciweavers

498 search results - page 46 / 100
» Robust web content extraction
Sort
View
ACL
2010
14 years 8 months ago
How Many Words Is a Picture Worth? Automatic Caption Generation for News Images
In this paper we tackle the problem of automatic caption generation for news images. Our approach leverages the vast resource of pictures available on the web and the fact that ma...
Yansong Feng, Mirella Lapata
ICTAI
2009
IEEE
15 years 4 months ago
Classifying Sentence-Based Summaries of Web Documents
Text classification categories Web documents in large collections into predefined classes based on their contents. Unfortunately, the classification process can be time-consumi...
Maria Soledad Pera, Yiu-Kai Ng
ICADL
2010
Springer
160views Education» more  ICADL 2010»
15 years 2 months ago
Thesaurus Extension Using Web Search Engines
Maintaining and extending large thesauri is an important challenge facing digital libraries and IT businesses alike. In this paper we describe a method building on and extending ex...
Robert Meusel, Mathias Niepert, Kai Eckert, Heiner...
WWW
2005
ACM
15 years 10 months ago
Browsing fatigue in handhelds: semantic bookmarking spells relief
Focused Web browsing activities such as periodically looking up headline news, weather reports, etc., which require only selective fragments of particular Web pages, can be made m...
Saikat Mukherjee, I. V. Ramakrishnan
ICASSP
2009
IEEE
15 years 4 months ago
Data hiding in hard-copy text documents robust to print, scan and photocopy operations
This paper describes a method for hiding data inside printed text documents that is resilient to print/scan and photocopying operations. Using the principle of channel coding with...
Avinash L. Varna, Shantanu Rane, Anthony Vetro