Sciweavers

311 search results - page 57 / 63
» Cleaning Web Pages for Effective Web Content Mining
Sort
View
SIGIR
2005
ACM
15 years 3 months ago
SimFusion: measuring similarity using unified relationship matrix
In this paper we use a Unified Relationship Matrix (URM) to represent a set of heterogeneous data objects (e.g., web pages, queries) and their interrelationships (e.g., hyperlinks...
Wensi Xi, Edward A. Fox, Weiguo Fan, Benyu Zhang, ...
WWW
2008
ACM
15 years 10 months ago
Analysis of geographic queries in a search engine log
Geography is becoming increasingly important in web search. Search engines can often return better results to users by analyzing features such as user location or geographic terms...
Qingqing Gan, Josh Attenberg, Alexander Markowetz,...
AINA
2005
IEEE
15 years 3 months ago
iHITS: Extending HITS for Personal Interests Profiling
Ever since the boom of World Wide Web, profiling online users' interests has become an important task for content providers. The traditional approach involves manual entry of...
Ziming Zhuang
ADMA
2006
Springer
143views Data Mining» more  ADMA 2006»
15 years 1 months ago
Robust Collective Classification with Contextual Dependency Network Models
Abstract. In order to exploit the dependencies in relational data to improve predictions, relational classification models often need to make simultaneous statistical judgments abo...
YongHong Tian, Tiejun Huang, Wen Gao
CIKM
2006
Springer
15 years 1 months ago
A probabilistic relevance propagation model for hypertext retrieval
A major challenge in developing models for hypertext retrieval is to effectively combine content information with the link structure available in hypertext collections. Although s...
Azadeh Shakery, ChengXiang Zhai