Search Sciweavers | Sciweavers

252 search results - page 20 / 51

» Mining a Web Citation Database for Document Clustering

197

Voted

ICDE
2004
IEEE

117views Database» more ICDE 2004»

Probe, Cluster, and Discover: Focused Extraction of QA-Pagelets from the Deep Web

16 years 3 months ago

Download www.cc.gatech.edu

In this paper, we introduce the concept of a QA-Pagelet to refer to the content region in a dynamic page that contains query matches. We present THOR, a scalable and efficient min...

James Caverlee, Ling Liu, David Buttler

claim paper

Read More »

click to vote

DEXAW
2008
IEEE

123views Database» more DEXAW 2008»

Text Extraction from the Web via Text-to-Tag Ratio

15 years 8 months ago

Download www.uni-weimar.de

– We describe a method to extract content text from diverse Web pages by using the HTML document’s Text-to-Tag Ratio rather than specific HTML cues that may not be constant acr...

Tim Weninger, William H. Hsu

claim paper

Read More »

Voted

KDD
1998
ACM

80views Data Mining» more KDD 1998»

Human Performance on Clustering Web Pages: A Preliminary Study

15 years 6 months ago

Download www.research.rutgers.edu

With the increase in information on the World Wide Web it has become difficult to quickly find desired information without using multiple queries or using a topic-specific search ...

Sofus A. Macskassy, Arunava Banerjee, Brian D. Dav...

claim paper

Read More »

123

Voted

CIKM
2006
Springer

148views Information Technology» more CIKM 2006»

Multi-evidence, multi-criteria, lazy associative document classification

15 years 5 months ago

Download www.cs.rpi.edu

We present a novel approach for classifying documents that combines different pieces of evidence (e.g., textual features of documents, links, and citations) transparently, through...

Adriano Veloso, Wagner Meira Jr., Marco Cristo, Ma...

claim paper

Read More »

105

click to vote

KDD
2002
ACM

138views Data Mining» more KDD 2002»

Learning to match and cluster large high-dimensional data sets for data integration

16 years 2 months ago

Download www.cs.cmu.edu

Part of the process of data integration is determining which sets of identifiers refer to the same real-world entities. In integrating databases found on the Web or obtained by us...

William W. Cohen, Jacob Richman

claim paper

Read More »

« Prev « First page 20 / 51 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers