Sciweavers

311 search results - page 27 / 63
» Cleaning Web Pages for Effective Web Content Mining
Sort
View
WSDM
2010
ACM
265views Data Mining» more  WSDM 2010»
15 years 6 months ago
Data-oriented Content Query System: Searching for Data into Text on the Web
As the Web provides rich data embedded in the immense contents inside pages, we witness many ad-hoc efforts for exploiting fine granularity information across Web text, such as We...
Kevin Chen-Chuan Chang, Mianwei Zhou, Tao Cheng
ECAI
2004
Springer
15 years 2 months ago
Finding Social Network for Trust Calculation
Trust is a necessary concept to realize the Semantic Web. But how can we build a “Web of Trust”? We first argue that a small “Web of Trust” for each community is very esse...
Yutaka Matsuo, Hironori Tomobe, Kôiti Hasida...
SIGMOD
2010
ACM
232views Database» more  SIGMOD 2010»
14 years 9 months ago
Optimizing content freshness of relations extracted from the web using keyword search
An increasing number of applications operate on data obtained from the Web. These applications typically maintain local copies of the web data to avoid network latency in data acc...
Mohan Yang, Haixun Wang, Lipyeow Lim, Min Wang
PKDD
2007
Springer
120views Data Mining» more  PKDD 2007»
15 years 3 months ago
Site-Independent Template-Block Detection
Detection of template and noise blocks in web pages is an important step in improving the performance of information retrieval and content extraction. Of the many approaches propos...
Aleksander Kolcz, Wen-tau Yih
KDD
2004
ACM
210views Data Mining» more  KDD 2004»
15 years 10 months ago
Web usage mining based on probabilistic latent semantic analysis
The primary goal of Web usage mining is the discovery of patterns in the navigational behavior of Web users. Standard approaches, such as clustering of user sessions and discoveri...
Xin Jin, Yanzan Zhou, Bamshad Mobasher