Sciweavers

563 search results - page 14 / 113
» Crawling the web for structured documents
Sort
View
DAWAK
2001
Springer
15 years 6 months ago
Discovering Web Document Associations for Web Site Summarization
Complex web information structures prevent search engines from providing satisfactory context-sensitive retrieval. We see that in order to overcome this obstacle, it is essential t...
K. Selçuk Candan, Wen-Syan Li
132
Voted
SEMWEB
2007
Springer
15 years 8 months ago
Media Watch on Climate Change: Building and Visualizing Contextualized Information Spaces
Abstract. This paper presents the ’Media Watch on Climate Change’, an interactive Web portal that combines a portfolio of semantic services with a visual interface based on tig...
Arno Scharl, Albert Weichselbraun, Alexander Hubma...
ESWS
2007
Springer
15 years 8 months ago
A Unified Approach to Retrieving Web Documents and Semantic Web Data
The Semantic Web seems to be evolving into a property-linked web of RDF data, conceptually divorced from (but physically housed in) the hyperlinked web of HTML documents. We discus...
Trivikram Immaneni, Krishnaprasad Thirunarayan
103
Voted
IADIS
2003
15 years 3 months ago
SPLAT: A System for Self-Plagiarism Detection
This paper presents a system for self-plagiarism detection, SPLAT. The system uses a WebL web spider that crawls through the web sites of the top fifty Computer Science department...
Christian S. Collberg, Stephen G. Kobourov, Joshua...
EP
1998
Springer
15 years 6 months ago
Measuring Structural Similarity Among Web Documents: Preliminary Results
When we describe a Web page informally, we often use phrases like it looks like a newspaper site", there are several unordered lists" or it's just a collection of li...
Isabel F. Cruz, Slava Borisov, Michael A. Marks, T...