Sciweavers

563 search results - page 18 / 113
» Crawling the web for structured documents
Sort
View
WSDM
2009
ACM
176views Data Mining» more  WSDM 2009»
15 years 8 months ago
The web changes everything: understanding the dynamics of web content
The Web is a dynamic, ever changing collection of information. This paper explores changes in Web content by analyzing a crawl of 55,000 Web pages, selected to represent different...
Eytan Adar, Jaime Teevan, Susan T. Dumais, Jonatha...
109
Voted
DAS
2004
Springer
15 years 7 months ago
Rule-Based Structural Analysis of Web Pages
Structural analysis of web pages has been proposed several times and for a number of reasons and purposes, such as the re-flowing of standard web pages to fit a smaller PDA screen....
Fabio Vitali, Angelo Di Iorio, Elisa Ventura Campo...
SIGIR
2008
ACM
15 years 1 months ago
Classifiers without borders: incorporating fielded text from neighboring web pages
Accurate web page classification often depends crucially on information gained from neighboring pages in the local web graph. Prior work has exploited the class labels of nearby p...
Xiaoguang Qi, Brian D. Davison
119
Voted
APWEB
2003
Springer
15 years 7 months ago
A Framework for Decentralized Ranking in Web Information Retrieval
Search engines are among the most important applications or services on the web. Most existing successful search engines use global ranking algorithms to generate the ranking of do...
Karl Aberer, Jie Wu
127
Voted
DL
2000
Springer
156views Digital Library» more  DL 2000»
15 years 6 months ago
Re-engineering structures from Web documents
To realise a wide range of applications (including digital libraries) on the Web, a more structured way of accessing the Web is required and such requirement can be facilitated by...
Chuang-Hue Moh, Ee-Peng Lim, Wee Keong Ng