Search Sciweavers | Sciweavers

820 search results - page 59 / 164

» Deep web data extraction

116

click to vote

WWW
2009
ACM

106views Internet Technology» more WWW 2009»

News article extraction with template-independent wrapper

15 years 10 months ago

Download www.cs.sfu.ca

We consider the problem of template-independent news extraction. The state-of-the-art news extraction method is based on template-level wrapper induction, which has two serious li...

Junfeng Wang, Xiaofei He, Can Wang, Jian Pei, Jiaj...

claim paper

Read More »

157

click to vote

JCIT
2010

149views more JCIT 2010»

People Summarization by Combining Named Entity Recognition and Relation Extraction

14 years 10 months ago

Download www.aicit.org

The two most important tasks in entity information summarization from the Web are named entity recognition and relation extraction. Little work has been done toward an integrated ...

Xiaojiang Liu, Nenghai Yu

claim paper

Read More »

click to vote

LREC
2010

216views Education» more LREC 2010»

BlogBuster: A Tool for Extracting Corpora from the Blogosphere

15 years 4 months ago

Download www.lrec-conf.org

This paper presents BlogBuster, a tool for extracting a corpus from the blogosphere. The topic of cleaning arbitrary web pages with the goal of extracting a corpus from web data, ...

Georgios Petasis, Dimitrios Petasis

claim paper

Read More »

132

click to vote

KDD
2003
ACM

148views Data Mining» more KDD 2003»

Mining data records in Web pages

16 years 3 months ago

Download www.cs.uic.edu

A large amount of information on the Web is contained in regularly structured objects, which we call data records. Such data records are important because they often present the e...

Bing Liu, Robert L. Grossman, Yanhong Zhai

claim paper

Read More »

252

click to vote

SIGMOD
2000
ACM

236views Database» more SIGMOD 2000»

XTRACT: A System for Extracting Document Type Descriptors from XML Documents

15 years 7 months ago

Download www.softnet.tuc.gr

XML is rapidly emerging as the new standard for data representation and exchange on the Web. An XML document can be accompanied by a Document Type Descriptor (DTD) which plays the...

Minos N. Garofalakis, Aristides Gionis, Rajeev Ras...

claim paper

Read More »

« Prev « First page 59 / 164 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers