Sciweavers

609 search results - page 15 / 122
» Adaptive record extraction from web pages
Sort
View
ICWE
2009
Springer
15 years 6 months ago
A Layout-Independent Web News Article Contents Extraction Method Based on Relevance Analysis
Abstract. The traditional Web news article contents extraction methods are time-costly and need much maintenance because they analyze the layout of news pages to generate the wrapp...
Hao Han, Takehiro Tokuda
CIKM
2003
Springer
15 years 5 months ago
Extracting unstructured data from template generated web documents
We propose a novel approach that identifies web page templates and extracts the unstructured data. Extracting only the body of the page and eliminating the template increases the ...
Ling Ma, Nazli Goharian, Abdur Chowdhury, Misun Ch...
ICMCS
2007
IEEE
183views Multimedia» more  ICMCS 2007»
15 years 6 months ago
Web Page Segmentation Based on Gestalt Theory
Automatic web page segmentation is the basis to adaptive web browsing on mobile devices. It breaks a large page into smaller blocks, in which contents with coherent semantics are ...
Peifeng Xiang, Xin Yang, Yuanchun Shi
EUSFLAT
2009
220views Fuzzy Logic» more  EUSFLAT 2009»
14 years 9 months ago
Web Usage Mining: users' navigational patterns extraction from web logs using ant-based clustering method
Web Usage Mining is the process of applying data mining techniques to the discovery of usage patterns from data extracted from Web Log files. It mines the secondary data (web logs)...
Kobra Etminani, Mohammad R. Akbarzadeh-Totonchi, N...
TKDE
2008
198views more  TKDE 2008»
14 years 11 months ago
Web People Search via Connection Analysis
Nowadays, searches for the web pages of a person with a given name constitute a notable fraction of queries to Web search engines. Such a query would normally return web pages rela...
Dmitri V. Kalashnikov, Zhaoqi Chen, Sharad Mehrotr...