Search Sciweavers | Sciweavers

874 search results - page 60 / 175

» Jedi: Extracting and Synthesizing Information from the Web

137

click to vote

SIGMOD
2009
ACM

140views Database» more SIGMOD 2009»

Robust web extraction: an approach based on a probabilistic tree-edit model

15 years 8 months ago

Download www-rcf.usc.edu

On script-generated web sites, many documents share common HTML tree structure, allowing wrappers to eﬀectively extract information of interest. Of course, the scripts and thus ...

Nilesh N. Dalvi, Philip Bohannon, Fei Sha

claim paper

Read More »

120

click to vote

CIKM
2008
Springer

155views Information Technology» more CIKM 2008»

Characterizing and predicting community members from evolutionary and heterogeneous networks

15 years 4 months ago

Download www.cais.ntu.edu.sg

Mining different types of communities from web data have attracted a lot of research efforts in recent years. However, none of the existing community mining techniques has taken i...

Qiankun Zhao, Sourav S. Bhowmick, Xin Zheng, Kai Y...

claim paper

Read More »

128

click to vote

EMNLP
2009

143views Natural Language Processing» more EMNLP 2009»

Toward Completeness in Concept Extraction and Classification

14 years 11 months ago

Download www.aclweb.org

Many algorithms extract terms from text together with some kind of taxonomic classification (is-a) link. However, the general approaches used today, and specifically the methods o...

Eduard H. Hovy, Zornitsa Kozareva, Ellen Riloff

claim paper

Read More »

123

click to vote

WWW
2010
ACM

257views Internet Technology» more WWW 2010»

CETR: content extraction via tag ratios

15 years 9 months ago

Download www.cs.illinois.edu

We present Content Extraction via Tag Ratios (CETR) – a method to extract content text from diverse webpages by using the HTML document’s tag ratios. We describe how to comput...

Tim Weninger, William H. Hsu, Jiawei Han

claim paper

Read More »

113

click to vote

PRIS
2004

100views Pattern Recognition» more PRIS 2004»

Learning Text Extraction Rules, without Ignoring Stop Words

15 years 3 months ago

Download www.di.ubi.pt

Information Extraction (IE) from text /web documents has become an important application area of AI. As the number of web sites and documents has grown dramatically, the users need...

João Cordeiro, Pavel Brazdil

claim paper

Read More »

« Prev « First page 60 / 175 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers