Search Sciweavers | Sciweavers

433 search results - page 27 / 87

» Web page title extraction and its application

click to vote

KES
2006
Springer

137views Information Technology» more KES 2006»

Web Site Off-Line Structure Reconfiguration: A Web User Browsing Analysis

14 years 9 months ago

Download wi.dii.uchile.cl

The correct web site text content must be help to the visitors to find what they are looking for. However, the reality is quite different, many times the web page text content is a...

Sebastián A. Ríos, Juan D. Vel&aacut...

claim paper

Read More »

click to vote

WWW
2005
ACM

150views Internet Technology» more WWW 2005»

Extracting context to improve accuracy for HTML content extraction

15 years 10 months ago

Download www1.cs.columbia.edu

Web pages contain clutter (such as ads, unnecessary images and extraneous links) around the body of an article, which distracts a user from actual content. Extraction of "use...

Suhit Gupta, Gail E. Kaiser, Salvatore J. Stolfo

claim paper

Read More »

click to vote

KDD
2005
ACM

194views Data Mining» more KDD 2005»

Web object indexing using domain knowledge

15 years 9 months ago

Download research.microsoft.com

Web object is defined to represent any meaningful object embedded in web pages (e.g. images, music) or pointed to by hyperlinks (e.g. downloadable files). Users usually search for...

Muyuan Wang, Zhiwei Li, Lie Lu, Wei-Ying Ma, Naiya...

claim paper

Read More »

click to vote

PKDD
2007
Springer

120views Data Mining» more PKDD 2007»

Site-Independent Template-Block Detection

15 years 3 months ago

Download research.microsoft.com

Detection of template and noise blocks in web pages is an important step in improving the performance of information retrieval and content extraction. Of the many approaches propos...

Aleksander Kolcz, Wen-tau Yih

claim paper

Read More »

113

click to vote

CIDR
2011

243views Algorithms» more CIDR 2011»

Longitudinal Analytics on Web Archive Data: It's About Time!

14 years 1 months ago

Download cedric.cnam.fr

Organizations like the Internet Archive have been capturing Web contents over decades, building up huge repositories of time-versioned pages. The timestamp annotations and the she...

Gerhard Weikum, Nikos Ntarmos, Marc Spaniol, Peter...

claim paper

Read More »

« Prev « First page 27 / 87 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers