Sciweavers

57 search results - page 1 / 12
» Expected Utility of Content Blocks in Web Content Extraction
Sort
View
BIS
2006
106views Business» more  BIS 2006»
13 years 6 months ago
Expected Utility of Content Blocks in Web Content Extraction
In this paper we discuss the possible application of new concepts in web content extraction: utility assessment, utility annealing, and dynamic aggregated document generation. Aft...
Marek Kowalkiewicz
WWW
2005
ACM
14 years 5 months ago
Extracting semantic structure of web documents using content and visual information
This work aims to provide a page segmentation algorithm which uses both visual and content information to extract the semantic structure of a web page. The visual information is u...
Rupesh R. Mehta, Pabitra Mitra, Harish Karnick
ER
2008
Springer
165views Database» more  ER 2008»
13 years 6 months ago
Content Ontology Design Patterns as Practical Building Blocks for Web Ontologies
In this paper we present how to extract and describe emerging content ontology design patterns, and how to compose, specialize and expand them for ontology design, with particular ...
Valentina Presutti, Aldo Gangemi
DEXA
2006
Springer
197views Database» more  DEXA 2006»
13 years 6 months ago
Cleaning Web Pages for Effective Web Content Mining
Classifying and mining noise-free web pages will improve on accuracy of search results as well as search speed, and may benefit webpage organization applications (e.g., keyword-bas...
Jing Li, Christie I. Ezeife
KDD
2002
ACM
148views Data Mining» more  KDD 2002»
14 years 5 months ago
Discovering informative content blocks from Web documents
In this paper, we propose a new approach to discover informative contents from a set of tabular documents (or Web pages) of a Web site. Our system, InfoDiscoverer, first partition...
Shian-Hua Lin, Jan-Ming Ho