Sciweavers

232 search results - page 26 / 47
» Query-related data extraction of hidden web documents
Sort
View
COMAD
2009
15 years 23 days ago
Querying for relations from the semi-structured Web
We present a class of web queries whose result is a multi-column relation instead of a collection of unstructured documents as in standard web search. The user specifies the query...
Sunita Sarawagi
112
Voted
WWW
2004
ACM
16 years 10 days ago
An efficient and systematic method to generate xslt stylesheets for different wireless pervasive devices
It is a tedious and cumbersome process to update directly a WML document for the wireless Web because its content composes of both data and presentation. Thus, XML is used to hand...
Thomas Kwok, Thao Nguyen, Linh Lam, Kakan Roy
WWW
2010
ACM
15 years 6 months ago
Not so creepy crawler: easy crawler generation with standard xml queries
Web crawlers are increasingly used for focused tasks such as the extraction of data from Wikipedia or the analysis of social networks like last.fm. In these cases, pages are far m...
Franziska von dem Bussche, Klara A. Weiand, Benedi...
81
Voted
WISE
2000
Springer
15 years 4 months ago
Modelling the Webspace of an Intranet
Searching the internet using the currently available searchengines is not satisfactory. Thetechniquesused there focus on the extraction of relevant informationdirectlyfrom the doc...
Roelof van Zwol, Peter M. G. Apers
90
Voted
ICWE
2007
Springer
15 years 5 months ago
Fixing Weakly Annotated Web Data Using Relational Models
In this paper, we present a fast and scalable Bayesian model for improving weakly annotated data – which is typically generated by a (semi) automated information extraction (IE) ...
Fatih Gelgi, Srinivas Vadrevu, Hasan Davulcu