Search Sciweavers | Sciweavers

408 search results - page 30 / 82

» Experiments with Geographic Evidence Extracted from Document...

click to vote

ER
2007
Springer

99views Database» more ER 2007»

VERT: A Semantic Approach for Content Search and Content Extraction in XML Query Processing

15 years 6 months ago

Download www.comp.nus.edu.sg

Processing a twig pattern query in XML document includes structural search and content search. Most existing algorithms only focus on structural search. They treat content nodes th...

Huayu Wu, Tok Wang Ling, Bo Chen

claim paper

Read More »

Voted

WWW
2009
ACM

122views Internet Technology» more WWW 2009»

SOFIE: a self-organizing framework for information extraction

16 years 1 months ago

Download www2009.org

This paper presents SOFIE, a system for automated ontology extension. SOFIE can parse natural language documents, extract ontological facts from them and link the facts into an on...

Fabian M. Suchanek, Mauro Sozio, Gerhard Weikum

claim paper

Read More »

103

Voted

INFOSCALE
2007
ACM

104views Information Technology» more INFOSCALE 2007»

Query-driven indexing for scalable peer-to-peer text retrieval

15 years 2 months ago

Download lsirpeople.epfl.ch

We present a query-driven algorithm for the distributed indexing of large document collections within structured P2P networks. To cope with bandwidth consumption that has been ide...

Gleb Skobeltsyn, Toan Luu, Ivana Podnar Zarko, Mar...

claim paper

Read More »

Voted

EWMF
2005
Springer

149views Internet Technology» more EWMF 2005»

Discovering a Term Taxonomy from Term Similarities Using Principal Component Analysis

15 years 6 months ago

Download lahuen.dcc.uchile.cl

Abstract. We show that eigenvector decomposition can be used to extract a term taxonomy from a given collection of text documents. So far, methods based on eigenvector decompositio...

Holger Bast, Georges Dupret, Debapriyo Majumdar, B...

claim paper

Read More »

104

Voted

WWW
2005
ACM

150views Internet Technology» more WWW 2005»

Extracting context to improve accuracy for HTML content extraction

16 years 1 months ago

Download www1.cs.columbia.edu

Web pages contain clutter (such as ads, unnecessary images and extraneous links) around the body of an article, which distracts a user from actual content. Extraction of "use...

Suhit Gupta, Gail E. Kaiser, Salvatore J. Stolfo

claim paper

Read More »

« Prev « First page 30 / 82 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers