Sciweavers

609 search results - page 84 / 122
» Adaptive record extraction from web pages
Sort
View
ICWE
2007
Springer
15 years 8 months ago
Fixing Weakly Annotated Web Data Using Relational Models
In this paper, we present a fast and scalable Bayesian model for improving weakly annotated data – which is typically generated by a (semi) automated information extraction (IE) ...
Fatih Gelgi, Srinivas Vadrevu, Hasan Davulcu
ICCV
2005
IEEE
15 years 7 months ago
Learning Non-Generative Grammatical Models for Document Analysis
— We present a general approach for the hierarchical segmentation and labeling of document layout structures. This approach models document layout as a grammar and performs a glo...
Michael Shilman, Percy Liang, Paul A. Viola
SIGIR
2005
ACM
15 years 7 months ago
Personalizing search via automated analysis of interests and activities
We formulate and study search algorithms that consider a user’s prior interactions with a wide variety of content to personalize that user’s current Web search. Rather than re...
Jaime Teevan, Susan T. Dumais, Eric Horvitz
ICDE
2003
IEEE
208views Database» more  ICDE 2003»
15 years 7 months ago
DBProxy: A dynamic data cache for Web applications
The majority of web pages served today are generated dynamically, usually by an application server querying a back-end database. To enhance the scalability of dynamic content serv...
Khalil Amiri, Sanghyun Park, Renu Tewari, Sriram P...
SIGMOD
2002
ACM
165views Database» more  SIGMOD 2002»
16 years 2 months ago
APEX: an adaptive path index for XML data
The emergence of the Web has increased interests in XML data. XML query languages such as XQuery and XPath use label paths to traverse the irregularly structured data. Without a s...
Chin-Wan Chung, Jun-Ki Min, Kyuseok Shim