Search Sciweavers | Sciweavers

11 search results - page 1 / 3

» Distant IE by Bootstrapping Using Lists and Document Structu...

click to vote

WEBI
2004
Springer

91views Internet Technology» more WEBI 2004»

Semi-Structured Complex List Extraction

13 years 10 months ago

Download www2.cs.uregina.ca

The semi-structured information available in HTML and similar documents provide valuable information that can be used for information extraction applications. This information tog...

Anders Arpteg

claim paper

Read More »

click to vote

EP
1998
Springer

169views Electronic Publishing» more EP 1998»

Measuring Structural Similarity Among Web Documents: Preliminary Results

13 years 9 months ago

Download www.cs.uic.edu

When we describe a Web page informally, we often use phrases like it looks like a newspaper site", there are several unordered lists" or it's just a collection of li...

Isabel F. Cruz, Slava Borisov, Michael A. Marks, T...

claim paper

Read More »

click to vote

KDD
2007
ACM

231views Data Mining» more KDD 2007»

Xproj: a framework for projected structural clustering of xml documents

14 years 5 months ago

Download www.cs.rpi.edu

XML has become a popular method of data representation both on the web and in databases in recent years. One of the reasons for the popularity of XML has been its ability to encod...

Charu C. Aggarwal, Na Ta, Jianyong Wang, Jianhua F...

claim paper

Read More »

click to vote

IPM
2008

141views more IPM 2008»

Towards a unified approach to document similarity search using manifold-ranking of blocks

13 years 4 months ago

Download dblab.mgt.ncu.edu.tw

Document similarity search (i.e. query by example) aims to retrieve a ranked list of documents similar to a query document in a text corpus or on the Web. Most existing approaches...

Xiaojun Wan, Jianwu Yang, Jianguo Xiao

claim paper

Read More »

click to vote

ESWS
2004
Springer

122views Internet Technology» more ESWS 2004»

Learning to Harvest Information for the Semantic Web

13 years 10 months ago

Download eprints.aktors.org

Abstract. In this paper we describe a methodology for harvesting information from large distributed repositories (e.g. large Web sites) with minimum user intervention. The methodol...

Fabio Ciravegna, Sam Chapman, Alexiei Dingli, Yori...

claim paper

Read More »

« Prev « First page 1 / 3 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers