Sciweavers

408 search results - page 30 / 82
» Experiments with Geographic Evidence Extracted from Document...
Sort
View
ER
2007
Springer
99views Database» more  ER 2007»
15 years 6 months ago
VERT: A Semantic Approach for Content Search and Content Extraction in XML Query Processing
Processing a twig pattern query in XML document includes structural search and content search. Most existing algorithms only focus on structural search. They treat content nodes th...
Huayu Wu, Tok Wang Ling, Bo Chen
80
Voted
WWW
2009
ACM
16 years 1 months ago
SOFIE: a self-organizing framework for information extraction
This paper presents SOFIE, a system for automated ontology extension. SOFIE can parse natural language documents, extract ontological facts from them and link the facts into an on...
Fabian M. Suchanek, Mauro Sozio, Gerhard Weikum
103
Voted
INFOSCALE
2007
ACM
15 years 2 months ago
Query-driven indexing for scalable peer-to-peer text retrieval
We present a query-driven algorithm for the distributed indexing of large document collections within structured P2P networks. To cope with bandwidth consumption that has been ide...
Gleb Skobeltsyn, Toan Luu, Ivana Podnar Zarko, Mar...
91
Voted
EWMF
2005
Springer
15 years 6 months ago
Discovering a Term Taxonomy from Term Similarities Using Principal Component Analysis
Abstract. We show that eigenvector decomposition can be used to extract a term taxonomy from a given collection of text documents. So far, methods based on eigenvector decompositio...
Holger Bast, Georges Dupret, Debapriyo Majumdar, B...
104
Voted
WWW
2005
ACM
16 years 1 months ago
Extracting context to improve accuracy for HTML content extraction
Web pages contain clutter (such as ads, unnecessary images and extraneous links) around the body of an article, which distracts a user from actual content. Extraction of "use...
Suhit Gupta, Gail E. Kaiser, Salvatore J. Stolfo