Search Sciweavers | Sciweavers

13 search results - page 1 / 3

» xCrawl: A High-Recall Crawling Method for Web Mining

click to vote

ICDM
2008
IEEE

186views Data Mining» more ICDM 2008»

xCrawl: A High-Recall Crawling Method for Web Mining

14 years 12 hour ago

Download ls13-www.cs.uni-dortmund.de

Web Mining Systems exploit the redundancy of data published on the Web to automatically extract information from existing web documents. The ﬁrst step in the Information Extract...

Kostyantyn M. Shchekotykhin, Dietmar Jannach, Gerh...

claim paper

Read More »

click to vote

ADMA
2009
Springer

142views Data Mining» more ADMA 2009»

Crawling Deep Web Using a New Set Covering Algorithm

14 years 5 days ago

Download cs.uwindsor.ca

Abstract. Crawling the deep web often requires the selection of an appropriate set of queries so that they can cover most of the documents in the data source with low cost. This ca...

Yan Wang, Jianguo Lu, Jessica Chen

claim paper

Read More »

click to vote

ECIR
2010
Springer

156views Information Technology» more ECIR 2010»

Mining Anchor Text Trends for Retrieval

13 years 7 months ago

Download www.cse.lehigh.edu

Anchor text has been considered as a useful resource to complement the representation of target pages and is broadly used in web search. However, previous research only uses anchor...

Na Dai, Brian D. Davison

claim paper

Read More »

click to vote

CORR
2011
Springer

326views Education» more CORR 2011»

Mining User Comment Activity for Detecting Forum Spammers in YouTube

13 years 2 days ago

Download ir.ii.uam.es

Research shows that comment spamming (comments which are unsolicited, unrelated, abusive, hateful, commercial advertisements etc) in online discussion forums has become a common p...

Ashish Sureka

claim paper

Read More »

click to vote

EMNLP
2011

164views Natural Language Processing» more EMNLP 2011»

Watermarking the Outputs of Structured Prediction with an application in Statistical Machine Translation

12 years 5 months ago

Download cs.jhu.edu

We propose a general method to watermark and probabilistically identify the structured outputs of machine learning algorithms. Our method is robust to local editing operations and...

Ashish Venugopal, Jakob Uszkoreit, David Talbot, F...

claim paper

Read More »

« Prev « First page 1 / 3 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers