Sciweavers

112 search results - page 1 / 23
» Clustering Template Based Web Documents
Sort
View
ECIR
2008
Springer
13 years 5 months ago
Clustering Template Based Web Documents
More and more documents on the World Wide Web are based on templates. On a technical level this causes those documents to have a quite similar source code and DOM tree structure. G...
Thomas Gottron
AAAI
1997
13 years 5 months ago
Template-Based Information Mining from HTML Documents
Tools for mining information from data can create added value for the Internet. As the majority of electronic documents available over the network are in unstructured textual form...
Jane Yung-jen Hsu, Wen-tau Yih
SIGIR
1998
ACM
13 years 8 months ago
Web Document Clustering: A Feasibility Demonstration
Users of Web search engines are often forced to sift through the long ordered list of document “snippets” returned by the engines. The IR community has explored document cluste...
Oren Zamir, Oren Etzioni
POPL
2000
ACM
13 years 8 months ago
A Type System for Dynamic Web Documents
Many interactive Web services use the CGI interface for communication with clients. They will dynamically create HTML documents that are presented to the client who then resumes t...
Anders Sandholm, Michael I. Schwartzbach
ICDM
2002
IEEE
162views Data Mining» more  ICDM 2002»
13 years 9 months ago
Phrase-based Document Similarity Based on an Index Graph Model
Document clustering techniques mostly rely on single term analysis of the document data set, such as the Vector Space Model. To better capture the structure of documents, the unde...
Khaled M. Hammouda, Mohamed S. Kamel