Sciweavers

8316 search results - page 201 / 1664
» Web Document Modeling
Sort
View
HT
2009
ACM
15 years 10 months ago
The redocumentation process of computer mediated activity traces: a general framework
The digital world enables the creation of personalized documents. In this paper we are interested in describing a computer mediated activity by a person throughout a semi-automati...
Leila Yahiaoui, Yannick Prié, Zizette Boufa...
LAWEB
2003
IEEE
15 years 9 months ago
On the Evolution of Clusters of Near-Duplicate Web Pages
This paper expands on a 1997 study of the amount and distribution of near-duplicate pages on the World Wide Web. We downloaded a set of 150 million web pages on a weekly basis ove...
Dennis Fetterly, Mark Manasse, Marc Najork
144
Voted
IPM
2006
146views more  IPM 2006»
15 years 4 months ago
Dictionary-based text categorization of chemical web pages
A new dictionary-based text categorization approach is proposed to classify the chemical web pages efficiently. Using a chemistry dictionary, the approach can extract chemistry-re...
Chunyan Liang, Li Guo, Zhaojie Xia, Feng-Guang Nie...
WWW
2003
ACM
16 years 5 months ago
Dynamic maintenance of web indexes using landmarks
Recent work on incremental crawling has enabled the indexed document collection of a search engine to be more synchronized with the changing World Wide Web. However, this synchron...
Lipyeow Lim, Min Wang, Sriram Padmanabhan, Jeffrey...
EUROMICRO
2003
IEEE
15 years 9 months ago
Web Service Engineering with DIWE
A Web service is frequently defined as browser-less access to content on a Web site. The industry’s focus to date has been on providing easy-to-use low-level libraries, tools a...
Engin Kirda, Clemens Kerer, Christopher Krüge...