Sciweavers

81 search results - page 2 / 17
» Estimating web site readability using content extraction
Sort
View
ICDM
2003
IEEE
225views Data Mining» more  ICDM 2003»
13 years 11 months ago
Combining the web content and usage mining to understand the visitor behavior in a web site
A web site is a semi structured collection of different kinds of data, whose motivation is show relevant information to visitor and by this way capture her/his attention. Understa...
Juan D. Velásquez, Hiroshi Yasuda, Terumasa...
WSDM
2009
ACM
161views Data Mining» more  WSDM 2009»
14 years 1 months ago
Predicting the readability of short web summaries
Readability is a crucial presentation attribute that web summarization algorithms consider while generating a querybaised web summary. Readability quality also forms an important ...
Tapas Kanungo, David Orr
WWW
2004
ACM
14 years 7 months ago
Characterization of a large web site population with implications for content delivery
This paper presents a systematic study of the properties of a large number of Web sites hosted by a major ISP. To our knowledge, ours is the first comprehensive study of a large s...
Leeann Bent, Michael Rabinovich, Geoffrey M. Voelk...
WWW
2003
ACM
14 years 7 months ago
DOM-based content extraction of HTML documents
Web pages often contain clutter (such as pop-up ads, unnecessary images and extraneous links) around the body of an article that distracts a user from actual content. Extraction o...
Suhit Gupta, Gail E. Kaiser, David Neistadt, Peter...
WWW
2010
ACM
14 years 1 months ago
Automatic extraction of clickable structured web contents for name entity queries
Today the major web search engines answer queries by showing ten result snippets, which need to be inspected by users for identifying relevant results. In this paper we investigat...
Xiaoxin Yin, Wenzhao Tan, Xiao Li, Yi-Chin Tu