Search Sciweavers | Sciweavers

62 search results - page 7 / 13

» Using web page layout for extraction of sender names

176

click to vote

ICDM
2007
IEEE

149views Data Mining» more ICDM 2007»

Extracting Author Meta-Data from Web Using Visual Features

16 years 17 days ago

Download www.cse.psu.edu

Enriching digital library’s author meta-data can lead to valuable services and applications. This paper addresses the problem of extracting authors’ information from their hom...

Shuyi Zheng, Ding Zhou, Jia Li, C. Lee Giles

claim paper

Read More »

199

click to vote

BMCBI
2011

219views Artificial Intelligence» more BMCBI 2011»

Extracting scientific articles from a large digital archive: BioStor and the Biodiversity Heritage Library

14 years 9 months ago

Download www.biomedcentral.com

Background: The Biodiversity Heritage Library (BHL) is a large digital archive of legacy biological literature, comprising over 31 million pages scanned from books, monographs, an...

Roderic D. M. Page

claim paper

Read More »

178

click to vote

ACL
2006

174views Computational Linguistics» more ACL 2006»

URES : an Unsupervised Web Relation Extraction System

15 years 7 months ago

Download acl.ldc.upenn.edu

Most information extraction systems either use hand written extraction patterns or use a machine learning algorithm that is trained on a manually annotated corpus. Both of these a...

Binyamin Rosenfeld, Ronen Feldman

claim paper

Read More »

160

click to vote

WWW
2008
ACM

118views Internet Technology» more WWW 2008»

Towards a global schema for web entities

16 years 7 months ago

Download www2008.org

Popular entities often have thousands of instances on the Web. In this paper, we focus on the case where they are presented in table-like format, namely appearing with their attri...

Conglei Yao, Yongjian Yu, Sicong Shou, Xiaoming Li

claim paper

Read More »

155

Voted

AAAI
2008

109views Intelligent Agents» more AAAI 2008»

An Unsupervised Approach for Product Record Normalization across Different Web Sites

15 years 8 months ago

Download www.aaai.org

An unsupervised probabilistic learning framework for normalizing product records across different retailer Web sites is presented. Our framework decomposes the problem into two ta...

Tak-Lam Wong, Tik-Shun Wong, Wai Lam

claim paper

Read More »

« Prev « First page 7 / 13 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers