Sciweavers

WEBDB
1998
Springer

Extracting Patterns and Relations from the World Wide Web

13 years 10 months ago
Extracting Patterns and Relations from the World Wide Web
The World Wide Web is a vast resource for information. At the same time it is extremely distributed. A particular type of data such as restaurant lists maybe scattered across thousands of independent information sources in many di erent formats. In this paper, we consider the problem of extracting a relation for such a data type from all of these sources automatically. We present a technique which exploits the duality between sets of patterns and relations to grow the target relation starting from a small sample. To test our technique we use it to extract a relation of (author,title) pairs from the World Wide Web.
Sergey Brin
Added 06 Aug 2010
Updated 06 Aug 2010
Type Conference
Year 1998
Where WEBDB
Authors Sergey Brin
Comments (0)