Sciweavers

391 search results - page 3 / 79
» Automatically Extracting Structure and Data from Business Re...
Sort
View
SIGMOD
2003
ACM
190views Database» more  SIGMOD 2003»
13 years 11 months ago
Extracting Structured Data from Web Pages
Many web sites contain large sets of pages generated using a common template or layout. For example, Amazon lays out the author, title, comments, etc. in the same way in all its b...
Arvind Arasu, Hector Garcia-Molina
WWW
2010
ACM
13 years 10 months ago
Web-scale knowledge extraction from semi-structured tables
A wealth of knowledge is encoded in the form of tables on the World Wide Web. We propose a classification algorithm and a rich feature set for automatically recognizing layout tab...
Eric Crestan, Patrick Pantel
JCDL
2006
ACM
237views Education» more  JCDL 2006»
13 years 11 months ago
Automatic extraction of table metadata from digital documents
Tables are used to present, list, summarize, and structure important data in documents. In scholarly articles, they are often used to present the relationships among data and high...
Ying Liu, Prasenjit Mitra, C. Lee Giles, Kun Bai
ICADL
2007
Springer
129views Education» more  ICADL 2007»
13 years 12 months ago
Using Automatic Metadata Extraction to Build a Structured Syllabus Repository
Syllabi are important documents created by instructors for students. Students use syllabi to find information and to prepare for class. Instructors often need to find similar syl...
Xiaoyan Yu, Manas Tungare, Weiguo Fan, Manuel A. P...
WWW
2009
ACM
14 years 6 months ago
Incorporating site-level knowledge to extract structured data from web forums
Web forums have become an important data resource for many web applications, but extracting structured data from unstructured web forum pages is still a challenging task due to bo...
Jiang-Ming Yang, Rui Cai, Yida Wang, Jun Zhu, Lei ...