Sciweavers

609 search results - page 33 / 122
» Adaptive record extraction from web pages
Sort
View
IPM
2006
146views more  IPM 2006»
15 years 1 months ago
Dictionary-based text categorization of chemical web pages
A new dictionary-based text categorization approach is proposed to classify the chemical web pages efficiently. Using a chemistry dictionary, the approach can extract chemistry-re...
Chunyan Liang, Li Guo, Zhaojie Xia, Feng-Guang Nie...
ER
2007
Springer
142views Database» more  ER 2007»
15 years 8 months ago
Automatic Hidden-Web Table Interpretation by Sibling Page Comparison
The longstanding problem of automatic table interpretation still illudes us. Its solution would not only be an aid to table processing applications such as large volume table conve...
Cui Tao, David W. Embley
DEEC
2006
IEEE
15 years 8 months ago
Optimization of Automatic Navigation to Hidden Web Pages by Ranking-Based Browser Preloading
Web applications have become an invaluable source of information for many different vertical solutions, but their complex navigation and semistructured format make their informatio...
Justo Hidalgo, José Losada, Manuel Á...
WWW
2003
ACM
16 years 2 months ago
DOM-based content extraction of HTML documents
Web pages often contain clutter (such as pop-up ads, unnecessary images and extraneous links) around the body of an article that distracts a user from actual content. Extraction o...
Suhit Gupta, Gail E. Kaiser, David Neistadt, Peter...
WIDM
2003
ACM
15 years 7 months ago
Schema-guided wrapper maintenance for web-data extraction
Extracting data from Web pages using wrappers is a fundamental problem arising in a large variety of applications of vast practical interests. There are two main issues relevant t...
Xiaofeng Meng, Dongdong Hu, Chen Li