Most prior work on information extraction has focused on extracting information from text in digital documents. However, often, the most important information being reported in an...
In this paper, we present Google, a prototype of a large-scale search engine which makes heavy use of the structure present in hypertext. Google is designed to crawl and index the...