Sciweavers

2677 search results - page 241 / 536
» Extracting Structured Data from Web Pages
Sort
View
123
Voted
NSDI
2010
15 years 4 months ago
The Architecture and Implementation of an Extensible Web Crawler
Many Web services operate their own Web crawlers to discover data of interest, despite the fact that largescale, timely crawling is complex, operationally intensive, and expensive...
Jonathan M. Hsieh, Steven D. Gribble, Henry M. Lev...
133
Voted
SIGCSE
2006
ACM
132views Education» more  SIGCSE 2006»
15 years 9 months ago
An interactive tutorial system for Java
interactive teaching materials, primarily because of its integration with the web through the applet mechanism. The 1997 and 1998 ITiCSE conferences convened working groups to deve...
Eric Roberts
WWW
2008
ACM
16 years 4 months ago
Networked graphs: a declarative mechanism for SPARQL rules, SPARQL views and RDF data integration on the web
Easy reuse and integration of declaratively described information in a distributed setting is one of the main motivations for building the Semantic Web. Despite of this claim, reu...
Simon Schenk, Steffen Staab
126
Voted
ACL
2006
15 years 4 months ago
Discriminating Image Senses by Clustering with Multimodal Features
We discuss Image Sense Discrimination (ISD), and apply a method based on spectral clustering, using multimodal features from the image and text of the embedding web page. We evalu...
Nicolas Loeff, Cecilia Ovesdotter Alm, David A. Fo...
117
Voted
VLDB
2000
ACM
133views Database» more  VLDB 2000»
15 years 7 months ago
Memex: A Browsing Assistant for Collaborative Archiving and Mining of Surf Trails
Keyword indices, topic directories, and link-based rankings are used to search and structure the rapidly growing Web today. Surprisingly little use is made of years of browsing ex...
Soumen Chakrabarti, Sandeep Srivastava, Mallela Su...