Sciweavers

39 search results - page 2 / 8
» A densitometric approach to web page segmentation
Sort
View
ICIW
2008
IEEE
13 years 11 months ago
Web Contents Tracking by Learning of Page Grammars
A significant fraction of Web data is available only for short periods of time. We consider methods to keep track and to record such dynamic information automatically. The main p...
Dirk Kukulenz, Christoph Reinke, Nils Hoeller
WWW
2008
ACM
14 years 6 months ago
Web page sectioning using regex-based template
This work aims to provide a novel, site-specific web page segmentation and section importance detection algorithm, which leverages structural, content, and visual information. The...
Rupesh R. Mehta, Amit Madaan
IPM
2006
146views more  IPM 2006»
13 years 5 months ago
Dictionary-based text categorization of chemical web pages
A new dictionary-based text categorization approach is proposed to classify the chemical web pages efficiently. Using a chemistry dictionary, the approach can extract chemistry-re...
Chunyan Liang, Li Guo, Zhaojie Xia, Feng-Guang Nie...
WWW
2004
ACM
14 years 6 months ago
Learning block importance models for web pages
Some previous works show that a web page can be partitioned to multiple segments or blocks, and usually the importance of those blocks in a page is not equivalent. Also, it is pro...
Ruihua Song, Haifeng Liu, Ji-Rong Wen, Wei-Ying Ma
ICDAR
2003
IEEE
13 years 10 months ago
Two Approaches for Text Segmentation in Web Images
There is a significant need to recognise the text in images on web pages, both for effective indexing and for presentation by non-visual means (e.g., audio). This paper presents a...
Dimosthenis Karatzas, Apostolos Antonacopoulos