Sciweavers

94 search results - page 4 / 19
» Combining Structure and Content Similarities for XML Documen...
Sort
View
EDBT
2004
ACM
172views Database» more  EDBT 2004»
15 years 9 months ago
Content-Based Routing of Path Queries in Peer-to-Peer Systems
Peer-to-peer (P2P) systems are gaining increasing popularity as a scalable means to share data among a large number of autonomous nodes. In this paper, we consider the case in whic...
Georgia Koloniari, Evaggelia Pitoura
EEE
2005
IEEE
15 years 3 months ago
Learning the Kernel Matrix for XML Document Clustering
The rapid growth of XML adoption has urged for the need of a proper representation for semi-structured documents, where the document structural information has to be taken into ac...
Jianwu Yang, William Kwok-Wai Cheung, Xiaoou Chen
INEX
2005
Springer
15 years 3 months ago
A Flexible Structured-Based Representation for XML Document Mining
This paper reports on the INRIA group’s approach to XML mining while participating in the INEX XML Mining track 2005. We use a flexible representation of XML documents that allo...
Anne-Marie Vercoustre, Mounir Fegas, Saba Gul, Yve...
ECIR
2008
Springer
14 years 11 months ago
Clustering Template Based Web Documents
More and more documents on the World Wide Web are based on templates. On a technical level this causes those documents to have a quite similar source code and DOM tree structure. G...
Thomas Gottron
71
Voted
WEBDB
2004
Springer
170views Database» more  WEBDB 2004»
15 years 2 months ago
Content and Structure in Indexing and Ranking XML
Rooted in electronic publishing, XML is now widely used for modelling and storing structured text documents. Especially in the WWW, retrieval of XML documents is most useful in co...
Felix Weigel, Holger Meuss, Klaus U. Schulz, Fran&...