Search Sciweavers | Sciweavers

1437 search results - page 102 / 288

» Content Extraction Signatures

130

click to vote

WSDM
2012
ACM

214views Data Mining» more WSDM 2012»

Selecting actions for resource-bounded information extraction using reinforcement learning

13 years 10 months ago

Download people.cs.umass.edu

Given a database with missing or uncertain content, our goal is to correct and ﬁll the database by extracting speciﬁc information from a large corpus such as the Web, and to d...

Pallika H. Kanani, Andrew K. McCallum

claim paper

Read More »

115

click to vote

ICMCS
2007
IEEE

203views Multimedia» more ICMCS 2007»

Attacking Some Perceptual Image Hash Algorithms

15 years 9 months ago

Download www.cosic.esat.kuleuven.be

Perceptual hashing is an emerging solution for multimedia content authentication. Due to their robustness, such techniques might not work well when malicious attack is perceptuall...

Li Weng, Bart Preneel

claim paper

Read More »

134

click to vote

ICDAR
2009
IEEE

168views Document Analysis» more ICDAR 2009»

Scalable Feature Extraction from Noisy Documents

15 years 10 months ago

Download www.cvc.uab.es

We cope with the metadata recognition in layoutoriented documents. We address the problem as a classiﬁcation task and propose a method for automatic extraction of relevant featu...

Loïc Lecerf, Boris Chidlovskii

claim paper

Read More »

117

click to vote

CIKM
2007
Springer

199views Information Technology» more CIKM 2007»

Comments-oriented blog summarization by sentence extraction

15 years 9 months ago

Download www.cais.ntu.edu.sg

Much existing research on blogs focused on posts only, ignoring their comments. Our user study conducted on summarizing blog posts, however, showed that reading comments does chan...

Meishan Hu, Aixin Sun, Ee-Peng Lim

claim paper

Read More »

119

click to vote

SOFSEM
2007
Springer

156views Theoretical Computer Science» more SOFSEM 2007»

Creating Permanent Test Collections of Web Pages for Information Extraction Research

15 years 9 months ago

Download www.dbai.tuwien.ac.at

In the research area of automatic web information extraction, there is a need for permanent and annotated web page collections enabling objective performance evaluation of differen...

Bernhard Pollak, Wolfgang Gatterbauer

claim paper

Read More »

« Prev « First page 102 / 288 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers