Sciweavers

145 search results - page 18 / 29
» Web Contents Tracking by Learning of Page Grammars
Sort
View
ICDE
2008
IEEE
143views Database» more  ICDE 2008»
15 years 10 months ago
Efficient Discovery of Authoritative Resources
Abstract- Given a dynamic corpus whose content and attention are changing on a daily basis, is it possible to collect and maintain the high-quality resources with a minimal investm...
Ravi Kumar, Kevin Lang, Cameron Marlow, Andrew Tom...
ATAL
2004
Springer
15 years 2 months ago
QueryTracker: An Agent for Tracking Persistent Information Needs
Most people have long term information interests. Current Web search engines satisfy immediate information needs. Specific sites support tracking of long term interests. We prese...
Gabriel Somlo, Adele E. Howe
CIDR
2011
243views Algorithms» more  CIDR 2011»
14 years 1 months ago
Longitudinal Analytics on Web Archive Data: It's About Time!
Organizations like the Internet Archive have been capturing Web contents over decades, building up huge repositories of time-versioned pages. The timestamp annotations and the she...
Gerhard Weikum, Nikos Ntarmos, Marc Spaniol, Peter...
WWW
2009
ACM
15 years 10 months ago
Extracting article text from the web with maximum subsequence segmentation
Much of the information on the Web is found in articles from online news outlets, magazines, encyclopedias, review collections, and other sources. However, extracting this content...
Jeff Pasternack, Dan Roth
IWANN
1999
Springer
15 years 1 months ago
Applying Ontology to the Web: A Case Study
This paper describes the use of Simple HTML Ontology Extensions (SHOE) in a real world internet application. SHOE allows authors to add semantic content to web pages and to relate...
Jeff Heflin, James A. Hendler, Sean Luke