Sciweavers

94 search results - page 5 / 19
» Combining Structure and Content Similarities for XML Documen...
Sort
View
ICDE
2009
IEEE
156views Database» more  ICDE 2009»
16 years 1 months ago
Distributed Structural Relaxation of XPath Queries
Due to the structural heterogeneity of XML, queries are often interpreted approximately. This is achieved by relaxing the query and ranking the results based on their relevance to ...
Georgia Koloniari, Evaggelia Pitoura
WWW
2007
ACM
16 years 9 days ago
A new suffix tree similarity measure for document clustering
In this paper, we propose a new similarity measure to compute the pairwise similarity of text-based documents based on suffix tree document model. By applying the new suffix tree ...
Hung Chim, Xiaotie Deng
ICDM
2002
IEEE
162views Data Mining» more  ICDM 2002»
15 years 4 months ago
Phrase-based Document Similarity Based on an Index Graph Model
Document clustering techniques mostly rely on single term analysis of the document data set, such as the Vector Space Model. To better capture the structure of documents, the unde...
Khaled M. Hammouda, Mohamed S. Kamel
COLING
2000
15 years 1 months ago
XML and Multilingual Document Authoring: Convergent Trends
Typical approaches to XML authoring view a XML document as a mixture of structure (the tags) and surface (text between the tags). We advocate a radical approach where the surface ...
Marc Dymetman, Veronika Lux, Aarne Ranta
TDM
2004
128views Database» more  TDM 2004»
15 years 1 months ago
Processing Content-And-Structure Queries for XML Retrieval
Document-centric XML collections contain text-rich documents, marked up with XML tags. The tags add lightweight semantics to the text. Querying such collections calls for a hybrid...
Börkur Sigurbjörnsson, Jaap Kamps, Maart...