Sciweavers

563 search results - page 33 / 113
» Crawling the web for structured documents
Sort
View
KDD
2007
ACM
189views Data Mining» more  KDD 2007»
16 years 2 months ago
Corroborate and learn facts from the web
The web contains lots of interesting factual information about entities, such as celebrities, movies or products. This paper describes a robust bootstrapping approach to corrobora...
Shubin Zhao, Jonathan Betz
WWW
2010
ACM
15 years 2 months ago
Structured audio podcasts via web text-to-speech system
Audio podcasting is increasingly present in the educational field and is especially appreciated as an ubiquitous/pervasive tool ("anywhere, anytime, at any pace") for ac...
Giulio Mori, Maria Claudia Buzzi, Marina Buzzi, Ba...
SAMT
2007
Springer
108views Multimedia» more  SAMT 2007»
15 years 8 months ago
Document Layout Substructure Discovery
Abstract. In this paper we present a system, DoLSuD, for the automatic discovery of relevant substructures in a document layout. DoLSuD, Document Layout Substructure Discovery, ext...
Claudio Andreatta
CIVR
2009
Springer
146views Image Analysis» more  CIVR 2009»
15 years 8 months ago
Web news categorization using a cross-media document graph
In this paper we propose a multimedia categorization framework that is able to exploit information across different parts of a multimedia document (e.g., a Web page, a PDF, a Micr...
José Iria, Fabio Ciravegna, João Mag...
DKE
2002
137views more  DKE 2002»
15 years 1 months ago
Reasoning for Web document associations and its applications in site map construction
Recently, there is an interest in using associations between web pages in providing users with pages relevant to what they are currently viewing. We believe that, to enable intell...
K. Selçuk Candan, Wen-Syan Li