Sciweavers

CIKM
2008
Springer
13 years 6 months ago
Characterizing and predicting community members from evolutionary and heterogeneous networks
Mining different types of communities from web data have attracted a lot of research efforts in recent years. However, none of the existing community mining techniques has taken i...
Qiankun Zhao, Sourav S. Bhowmick, Xin Zheng, Kai Y...
VLDB
1999
ACM
134views Database» more  VLDB 1999»
13 years 8 months ago
Capturing and Querying Multiple Aspects of Semistructured Data
Motivated to a large extent by the substantial and growing prominence of the World-Wide Web and the potential benefits that may be obtained by applying database concepts and tech...
Curtis E. Dyreson, Michael H. Böhlen, Christi...
WIDM
1999
ACM
13 years 8 months ago
Automatic Migration of Files into Relational Databases
In order to provide database-like features for files, particularly for searching in Web data, one solution is to migrate file data into a relational database. Having stored the da...
Uwe Hohenstein, Andreas Ebert
WISE
2002
Springer
13 years 9 months ago
Querying Web Data - The WebQA Approach
The common paradigm of searching and retrieving information on the Web is based on keyword-based search using one or more search engines, and then browsing through the large numbe...
Sunny K. S. Lam, M. Tamer Özsu
DBPL
2003
Springer
109views Database» more  DBPL 2003»
13 years 9 months ago
Modelling Dynamic Web Data
We introduce the Xdπ calculus, a peer-to-peer model for reasoning about dynamic web data. Web data is not just stored statically. Rather it is referenced indirectly, for example ...
Philippa Gardner, Sergio Maffeis
WIDM
2003
ACM
13 years 9 months ago
Datarover: a taxonomy based crawler for automated data extraction from data-intensive websites
The advent of e-commerce has created a trend that brought thousands of catalogs online. Most of these websites are “taxonomy-directed”. A Web site is said to be ``taxonomydire...
Hasan Davulcu, S. Koduri, Saravanakumar Nagarajan
IDEAS
2005
IEEE
142views Database» more  IDEAS 2005»
13 years 10 months ago
Automatically Maintaining Wrappers for Web Sources
A substantial subset of the web data follows some kind of underlying structure. Nevertheless, HTML does not contain any schema or semantic information about the data it represents...
Juan Raposo, Alberto Pan, Manuel Álvarez, J...
DEEC
2006
IEEE
13 years 10 months ago
Maintaining Web Navigation Flows for Wrappers
A substantial subset of the web data follows some kind of underlying structure. In order to let software programs gain full benefit from these “semistructured” web sources, wra...
Juan Raposo, Manuel Álvarez, José Lo...
IPPS
2008
IEEE
13 years 11 months ago
Multi-threaded data mining of EDGAR CIKs (Central Index Keys) from ticker symbols
This paper describes how use the Java Swing HTMLEditorKit to perform multi-threaded web data mining on the EDGAR system (Electronic DataGathering, Analysis, and Retrieval system)....
Dougal A. Lyon
ICIW
2008
IEEE
13 years 11 months ago
Web Contents Tracking by Learning of Page Grammars
A significant fraction of Web data is available only for short periods of time. We consider methods to keep track and to record such dynamic information automatically. The main p...
Dirk Kukulenz, Christoph Reinke, Nils Hoeller