Sciweavers

131 search results - page 15 / 27
» Ranking-Constrained Keyword Sequence Extraction from Web Doc...
Sort
View
66
Voted
IEICET
2006
116views more  IEICET 2006»
14 years 9 months ago
Extraction of Semantic Text Portion Related to Anchor Link
Recently, semantic text portion (STP) is getting popular in the field of Web mining. STP is a text portion in the original page which is semantically related to the anchor pointing...
Bui Quang Hung, Masanori Otsubo, Yoshinori Hijikat...
85
Voted
WWW
2010
ACM
15 years 4 months ago
Sampling high-quality clicks from noisy click data
Click data captures many users’ document preferences for a query and has been shown to help significantly improve search engine ranking. However, most click data is noisy and of...
Adish Singla, Ryen W. White
WWW
2008
ACM
15 years 10 months ago
Web graph similarity for anomaly detection (poster)
Web graphs are approximate snapshots of the web, created by search engines. Their creation is an error-prone procedure that relies on the availability of Internet nodes and the fa...
Panagiotis Papadimitriou 0002, Ali Dasdan, Hector ...
WWW
2005
ACM
15 years 10 months ago
Automatically learning document taxonomies for hierarchical classification
While several hierarchical classification methods have been applied to web content, such techniques invariably rely on a pre-defined taxonomy of documents. We propose a new techni...
Kunal Punera, Suju Rajan, Joydeep Ghosh
WWW
2009
ACM
15 years 10 months ago
Mining multilingual topics from wikipedia
In this paper, we try to leverage a large-scale and multilingual knowledge base, Wikipedia, to help effectively analyze and organize Web information written in different languages...
Xiaochuan Ni, Jian-Tao Sun, Jian Hu, Zheng Chen