Sciweavers

120 search results - page 3 / 24
» Extracting Instances of Relations from Web Documents Using R...
Sort
View
WWW
2006
ACM
14 years 6 months ago
Using graph matching techniques to wrap data from PDF documents
Wrapping is the process of navigating a data source, semiautomatically extracting data and transforming it into a form suitable for data processing applications. There are current...
Tamir Hassan, Robert Baumgartner
WWW
2007
ACM
14 years 6 months ago
Extraction and search of chemical formulae in text documents on the web
Often scientists seek to search for articles on the Web related to a particular chemical. When a scientist searches for a chemical formula using a search engine today, she gets ar...
Bingjun Sun, Qingzhao Tan, Prasenjit Mitra, C. Lee...
WEBDB
2010
Springer
156views Database» more  WEBDB 2010»
13 years 10 months ago
Redundancy-Driven Web Data Extraction and Integration
A large number of web sites publish pages containing structured information about recognizable concepts, but these data are only partially used by current applications. Although s...
Paolo Papotti, Valter Crescenzi, Paolo Merialdo, M...
ECWEB
2005
Springer
127views ECommerce» more  ECWEB 2005»
13 years 10 months ago
Knowledge Discovery in Web-Directories: Finding Term-Relations to Build a Business Ontology
The Web continues to grow at a tremendous rate. Search engines find it increasingly difficult to provide useful results. To manage this explosively large number of Web documents,...
Sandip Debnath, Tracy Mullen, Arun Upneja, C. Lee ...
KDD
2002
ACM
148views Data Mining» more  KDD 2002»
14 years 5 months ago
Discovering informative content blocks from Web documents
In this paper, we propose a new approach to discover informative contents from a set of tabular documents (or Web pages) of a Web site. Our system, InfoDiscoverer, first partition...
Shian-Hua Lin, Jan-Ming Ho