Search Sciweavers | Sciweavers

1014 search results - page 21 / 203

» Using Keyword Extraction for Web Site Clustering

227

click to vote

AIRWEB
2007
Springer

214views Internet Technology» more AIRWEB 2007»

Extracting Link Spam using Biased Random Walks from Spam Seed Sets

16 years 1 months ago

Download airweb.cse.lehigh.edu

Link spam deliberately manipulates hyperlinks between web pages in order to unduly boost the search engine ranking of one or more target pages. Link based ranking algorithms such ...

Baoning Wu, Kumar Chellapilla

claim paper

Read More »

253

click to vote

ISCIS
2009
Springer

234views Information Technology» more ISCIS 2009»

PopulusLog: People information database

16 years 29 min ago

Download filer.case.edu

—Information about individuals on publicly available web sites stands as a valuable, yet unorganized, data source. Turning such an enormous data source into a “database” is h...

Ali Cakmak, Mustafa Kirac, Gultekin Özsoyoglu

claim paper

Read More »

195

Voted

WWW
2007
ACM

131views Internet Technology» more WWW 2007»

U-REST: an unsupervised record extraction system

16 years 8 months ago

Download people.csail.mit.edu

In this paper, we describe a system that can extract record structures from web pages with no direct human supervision. Records are commonly occurring HTML-embedded data tuples th...

Yuan Kui Shen, David R. Karger

claim paper

Read More »

192

Voted

AAAI
2008

109views Intelligent Agents» more AAAI 2008»

An Unsupervised Approach for Product Record Normalization across Different Web Sites

15 years 9 months ago

Download www.aaai.org

An unsupervised probabilistic learning framework for normalizing product records across different retailer Web sites is presented. Our framework decomposes the problem into two ta...

Tak-Lam Wong, Tik-Shun Wong, Wai Lam

claim paper

Read More »

203

Voted

JUCS
2008

123views more JUCS 2008»

Exploring Information Extraction Resilience

15 years 7 months ago

Download www.jucs.org

: There are many challenges developers face when attempting to reliably extract data from the Web. One of these challenges is the resilience of the extraction system to changes in ...

Dawn G. Gregg

claim paper

Read More »

« Prev « First page 21 / 203 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers