This paper examines several different approaches to exploiting structural information in semi-structured document categorization. The methods under consideration are designed for ...
The primary objective of document annotation in whatever form, manual or electronic is to allow those who may not have control to original document to provide personal view on inf...
The rapid growth of the web has been noted and tracked extensively. Recent studies have however documented the dual phenomenon: web pages have small half lives, and thus the web e...
Ziv Bar-Yossef, Andrei Z. Broder, Ravi Kumar, Andr...
Finding accurate information on the web has become a challenge due to the increment in the number of documents available on line. Current search engines retrieve relevant document...
Alejandro Del-Castillo-Escobedo, Manuel Montes-y-G...
We design a schema language that includes channel schemas with capabilities of input, output, and input-output. These schemas may describe documents containing references to operat...