Sciweavers

SIGMOD
2003
ACM
190views Database» more  SIGMOD 2003»
13 years 9 months ago
Extracting Structured Data from Web Pages
Many web sites contain large sets of pages generated using a common template or layout. For example, Amazon lays out the author, title, comments, etc. in the same way in all its b...
Arvind Arasu, Hector Garcia-Molina