Sciweavers

311 search results - page 43 / 63
» Cleaning Web Pages for Effective Web Content Mining
Sort
View
90
Voted
KDD
2009
ACM
194views Data Mining» more  KDD 2009»
15 years 10 months ago
Combining link and content for community detection: a discriminative approach
In this paper, we consider the problem of combining link and content analysis for community detection from networked data, such as paper citation networks and Word Wide Web. Most ...
Tianbao Yang, Rong Jin, Yun Chi, Shenghuo Zhu
AIRS
2006
Springer
15 years 1 months ago
Learning to Separate Text Content and Style for Classification
Many text documents naturally have two kinds of labels. For example, we may label web pages from universities according to their categories, such as "student" or "fa...
Dell Zhang, Wee Sun Lee
WWW
2007
ACM
15 years 10 months ago
Tag clouds for summarizing web search results
In this paper, we describe an application, PubCloud that uses tag clouds for the summarization of results from queries over the PubMed database of biomedical literature. PubCloud ...
Benjamin M. Good, Byron Yu-Lin Kuo, Mark D. Wilkin...
AIRS
2005
Springer
15 years 3 months ago
Subsite Retrieval: A Novel Concept for Topic Distillation
Topic distillation is one of the main information needs when users search the Web. In previous approaches to topic distillation, the single page was treated as the basic searching ...
Tao Qin, Tie-Yan Liu, Xu-Dong Zhang, Guang Feng, W...
WWW
2004
ACM
15 years 10 months ago
An automatic semantic relationships discovery approach
An important obstacle to the success of the Semantic Web is that the establishment of the semantic relationship is labor-intensive. This paper proposes an automatic semantic relat...
Hai Zhuge, Liping Zheng, Nan Zhang 0007, Xiang Li