Search Sciweavers | Sciweavers

820 search results - page 52 / 164

» Deep web data extraction

140

click to vote

BTW
2003
Springer

140views Database» more BTW 2003»

An Ontology for Domain-oriented Semantic Similarity Search on XML Data

15 years 8 months ago

Download www.mpi-inf.mpg.de

Abstract: Query languages for XML such as XPath or XQuery support Boolean retrieval where a query result is a (possibly restructured) subset of XML elements or entire documents tha...

Anja Theobald

claim paper

Read More »

137

Voted

WWW
2010
ACM

257views Internet Technology» more WWW 2010»

CETR: content extraction via tag ratios

15 years 10 months ago

Download www.cs.illinois.edu

We present Content Extraction via Tag Ratios (CETR) – a method to extract content text from diverse webpages by using the HTML document’s tag ratios. We describe how to comput...

Tim Weninger, William H. Hsu, Jiawei Han

claim paper

Read More »

128

click to vote

AUSAI
2003
Springer

81views Artificial Intelligence» more AUSAI 2003»

Information Extraction via Path Merging

15 years 8 months ago

Download www.ict.csiro.au

Abstract. In this paper, we describe a new approach to information extraction that neatly integrates top-down hypothesis driven information with bottom-up data driven information. ...

Robert Dale, Cécile Paris, Marc Tilbrook

claim paper

Read More »

139

Voted

VLDB
2001
ACM

109views Database» more VLDB 2001»

Mining Multi-Dimensional Constrained Gradients in Data Cubes

15 years 7 months ago

Download www.cs.sfu.ca

Constrained gradient analysis (similar to the “cubegrade” problem posed by Imielinski, et al. [9]) is to extract pairs of similar cell characteristics associated with big chan...

Guozhu Dong, Jiawei Han, Joyce M. W. Lam, Jian Pei...

claim paper

Read More »

120

click to vote

PAKDD
2010
ACM

167views Data Mining» more PAKDD 2010»

Resource-Bounded Information Extraction: Acquiring Missing Feature Values on Demand

15 years 6 months ago

Download www.cs.umass.edu

We present a general framework for the task of extracting speciﬁc information “on demand” from a large corpus such as the Web under resource-constraints. Given a database wit...

Pallika Kanani, Andrew McCallum, Shaohan Hu

claim paper

Read More »

« Prev « First page 52 / 164 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers