Search Sciweavers | Sciweavers

148 search results - page 23 / 30

» Landmark Extraction: A Web Mining Approach

236

click to vote

JCDL
2004
ACM

198views Education» more JCDL 2004»

Finding authoritative people from the web

16 years 1 months ago

Download www.ingrid.org

Today’s web is so huge and diverse that it arguably reﬂects the real world. For this reason, searching the web is a promising approach to ﬁnd things in the real world. This ...

Masanori Harada, Shin-ya Sato, Kazuhiro Kazama

claim paper

Read More »

212

click to vote

WSDM
2010
ACM

204views Data Mining» more WSDM 2010»

Learning URL patterns for webpage de-duplication

16 years 2 months ago

Download www.wsdm-conference.org

Presence of duplicate documents in the World Wide Web adversely aﬀects crawling, indexing and relevance, which are the core building blocks of web search. In this paper, we pres...

Hema Swetha Koppula, Krishna P. Leela, Amit Agarwa...

claim paper

Read More »

187

click to vote

SAC
2005
ACM

124views Applied Computing» more SAC 2005»

Automatic wrapper maintenance for semi-structured web sources using results from previous queries

16 years 1 months ago

Download www.tic.udc.es

During the last years, significant attention has been paid to the problem of building wrappers for extracting data from semistructured web sources. Nevertheless, since web sources...

Juan Raposo, Alberto Pan, Manuel Álvarez, &...

claim paper

Read More »

198

click to vote

EMNLP
2007

118views Natural Language Processing» more EMNLP 2007»

Learning to Find English to Chinese Transliterations on the Web

15 years 9 months ago

Download www.aclweb.org

We present a method for learning to find English to Chinese transliterations on the Web. In our approach, proper nouns are expanded into new queries aimed at maximizing the probab...

Jian-Cheng Wu, Jason S. Chang

claim paper

Read More »

213

click to vote

WWW
2005
ACM

99views Internet Technology» more WWW 2005»

The volume and evolution of web page templates

16 years 8 months ago

Download research.yahoo.com

Web pages contain a combination of unique content and template material, which is present across multiple pages and used primarily for formatting, navigation, and branding. We stu...

David Gibson, Kunal Punera, Andrew Tomkins

claim paper

Read More »

« Prev « First page 23 / 30 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers