Sciweavers

146 search results - page 26 / 30
» RoadRunner: Towards Automatic Data Extraction from Large Web...
Sort
View
87
Voted
NAR
2002
132views more  NAR 2002»
14 years 9 months ago
euGenes: a eukaryote genome information system
euGenes is a genome information system and database that provides a common summary of eukaryote genes and genomes, at web site http://iubio.bio.indiana.edu/eugenes/. Seven popular...
Donald G. Gilbert
WEBDB
2005
Springer
102views Database» more  WEBDB 2005»
15 years 3 months ago
Design and Implementation of a Geographic Search Engine
In this paper, we describe the design and initial implementation of a geographic search engine prototype for Germany, based on a large crawl of the de domain. Geographic search en...
Alexander Markowetz, Yen-Yu Chen, Torsten Suel, Xi...
SIGMOD
2004
ACM
142views Database» more  SIGMOD 2004»
15 years 9 months ago
Understanding Web Query Interfaces: Best-Effort Parsing with Hidden Syntax
Recently, the Web has been rapidly "deepened" by many searchable databases online, where data are hidden behind query forms. For modelling and integrating Web databases,...
Zhen Zhang, Bin He, Kevin Chen-Chuan Chang
HT
2004
ACM
15 years 3 months ago
Dynamically growing hypertext collections
Many approaches have been pursued over the years to facilitate creating, organizing, and sharing collections of materials extracted from large information spaces. Little attention...
Pratik Dave, Paul Logasa Bogen II, Unmil Karadkar,...
WSDM
2009
ACM
117views Data Mining» more  WSDM 2009»
15 years 4 months ago
Query by document
We are experiencing an unprecedented increase of content contributed by users in forums such as blogs, social networking sites and microblogging services. Such abundance of conten...
Yin Yang, Nilesh Bansal, Wisam Dakka, Panagiotis G...