Sciweavers

820 search results - page 72 / 164
» Deep web data extraction
Sort
View
KCAP
2005
ACM
15 years 3 months ago
Collecting paraphrase corpora from volunteer contributors
Extensive and deep paraphrase corpora are important for a variety of natural language processing and user interaction tasks. In this paper, we present an approach which i) collect...
Timothy Chklovski
EDBT
2009
ACM
123views Database» more  EDBT 2009»
15 years 4 months ago
High-performance information extraction with AliBaba
A wealth of information is available only in web pages, patents, publications etc. Extracting information from such sources is challenging, both due to the typically complex langu...
Peter Palaga, Long Nguyen, Ulf Leser, Jörg Ha...
CN
2006
163views more  CN 2006»
14 years 10 months ago
A framework for mining evolving trends in Web data streams using dynamic learning and retrospective validation
The expanding and dynamic nature of the Web poses enormous challenges to most data mining techniques that try to extract patterns from Web data, such as Web usage and Web content....
Olfa Nasraoui, Carlos Rojas, Cesar Cardona
IRI
2009
IEEE
15 years 4 months ago
Ontology Guided Autonomous Label Assignment in Wrapper Induced Tables with Missing Column Names
Formulating and executing queries over distributed, autonomous and heterogeneous resources is an important research area. The advent of the Internet and the Web and their inherent...
Mohammad Shafkat Amin, Hasan M. Jamil
ACMICEC
2004
ACM
171views ECommerce» more  ACMICEC 2004»
15 years 3 months ago
Efficient integration of web services with distributed data flow and active mediation
This paper presents a loosely coupled service-composition paradigm. This paradigm employs a distributed data flow that differs markedly from centralized information flow adopted b...
David Liu, Jun Peng, Kincho H. Law, Gio Wiederhold