Sciweavers

LAWEB
2003
IEEE
13 years 9 months ago
Syntactic Similarity of Web Documents
This paper presents and compares two methods for evaluating the syntactic similarity between documents. The first method uses the Patricia tree, constructed from the original doc...
Álvaro R. Pereira Jr., Nivio Ziviani
ERCIMDL
2009
Springer
167views Education» more  ERCIMDL 2009»
13 years 11 months ago
A Compressed Self-indexed Representation of XML Documents
This paper presents a structure we call XML Wavelet Tree (XWT) to represent any XML document in a compressed and self-indexed form. Therefore, any query or procedure that could be ...
Nieves R. Brisaboa, Ana Cerdeira-Pena, Gonzalo Nav...
ICDE
2010
IEEE
251views Database» more  ICDE 2010»
14 years 4 months ago
Viewing a World of Annotations through AnnoVIP
The proliferation of electronic content has notably lead to the apparition of large corpora of interrelated structured documents (such as HTML and XML Web pages) and semantic annot...
Konstantinos Karanasos, Spyros Zoupanos
ICDE
2002
IEEE
113views Database» more  ICDE 2002»
14 years 5 months ago
XGRIND: A Query-Friendly XML Compressor
XML documents are extremely verbose since the "schema" is repeated for every "record" in the document. While a variety of compressors are available to address ...
Pankaj M. Tolani, Jayant R. Haritsa