Sciweavers

8795 search results - page 58 / 1759
» Measuring Generality of Documents
Sort
View
115
Voted
SIGIR
2008
ACM
15 years 3 months ago
Local text reuse detection
Text reuse occurs in many different types of documents and for many different reasons. One form of reuse, duplicate or near-duplicate documents, has been a focus of researchers be...
Jangwon Seo, W. Bruce Croft
129
Voted
CIKM
2011
Springer
14 years 3 months ago
The quality of the XML web
We collect evidence to answer the following question: Is the quality of the XML documents found on the web sufficient to apply XML technology like XQuery, XPath and XSLT? XML coll...
Steven Grijzenhout, Maarten Marx
CICLING
2001
Springer
15 years 8 months ago
Automatic Keyword Extraction Using Domain Knowledge
Documents can be assigned keywords by frequency analysis of the terms found in the document text, which arguably is the primary source of knowledge about the document itself. By in...
Anette Hulth, Jussi Karlgren, Anna Jonsson, Henrik...
147
Voted
DMIN
2006
293views Data Mining» more  DMIN 2006»
15 years 5 months ago
Arabic Text Classification Using N-Gram Frequency Statistics A Comparative Study
This paper presents the results of classifying Arabic text documents using the N-gram frequency statistics technique employing a dissimilarity measure called the "Manhattan di...
Laila Khreisat
122
Voted
APSEC
2002
IEEE
15 years 8 months ago
Embedding XML Processing Toolkit on General Purpose Programming Language
Many methods for XML processing have been proposed in the last few years. One popular approach is to process XML documents by using existing programming languages. Another popular...
Tetsuo Kamina, Tetsuo Tamai