Search Sciweavers | Sciweavers

2677 search results - page 394 / 536

» Extracting Structured Data from Web Pages

129

click to vote

TREC
2007

147views Information Technology» more TREC 2007»

DUTIR at TREC 2007 Blog Track

15 years 4 months ago

Download trec.nist.gov

This paper describes DUTIR at TREC 2007 Blog Track. In data preprocessing, a non English language list created from the corpus was used to remove the non English blogs, blog templ...

Rui Song, Qin Tang, Daming Shi 0002, Hongfei Lin, ...

claim paper

Read More »

138

click to vote

ESWS
2010
Springer

279views Internet Technology» more ESWS 2010»

LESS - Template-Based Syndication and Presentation of Linked Data

15 years 8 months ago

Download www.informatik.uni-leipzig.de

Recently, the publishing of structured, semantic information as linked data has gained quite some momentum. For ordinary users on the Internet, however, this information is not yet...

Sören Auer, Raphael Doehring, Sebastian Dietz...

claim paper

Read More »

147

click to vote

PLDI
2010
ACM

361views Programming Languages» more PLDI 2010»

A Context-free Markup Language for Semi-structured Text

16 years 17 days ago

Download www.cs.princeton.edu

An ad hoc data format is any non-standard, semi-structured data format for which robust data processing tools are not available. In this paper, we present ANNE, a new kind of mark...

Qian Xi, David Walker

claim paper

Read More »

134

click to vote

WWW
2008
ACM

214views Internet Technology» more WWW 2008»

16 years 3 months ago

Efficient similarity joins for near duplicate detection

Download www2008.org

With the increasing amount of data and the need to integrate data from multiple data sources, a challenging issue is to find near duplicate records efficiently. In this paper, we ...

Chuan Xiao, Wei Wang 0011, Xuemin Lin, Jeffrey Xu ...

claim paper

Read More »

137

click to vote

KDD
2009
ACM

156views Data Mining» more KDD 2009»

Query result clustering for object-level search

16 years 3 months ago

Download research.microsoft.com

Query result clustering has recently attracted a lot of attention to provide users with a succinct overview of relevant results. However, little work has been done on organizing t...

Jongwuk Lee, Seung-won Hwang, Zaiqing Nie, Ji-Rong...

claim paper

Read More »

« Prev « First page 394 / 536 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers