Sciweavers

148 search results - page 16 / 30
» Landmark Extraction: A Web Mining Approach
Sort
View
PKDD
2007
Springer
120views Data Mining» more  PKDD 2007»
15 years 8 months ago
Site-Independent Template-Block Detection
Detection of template and noise blocks in web pages is an important step in improving the performance of information retrieval and content extraction. Of the many approaches propos...
Aleksander Kolcz, Wen-tau Yih
KDD
2005
ACM
182views Data Mining» more  KDD 2005»
16 years 2 months ago
Making holistic schema matching robust: an ensemble approach
The Web has been rapidly "deepened" by myriad searchable databases online, where data are hidden behind query interfaces. As an essential task toward integrating these m...
Bin He, Kevin Chen-Chuan Chang
ADMA
2006
Springer
150views Data Mining» more  ADMA 2006»
15 years 7 months ago
Web Scale Competitor Discovery Using Mutual Information
Abstract. The web with its rapid expansion has become an excellent resource for gathering information and people’s opinion. A company owner wants to know who is the competitor, a...
Rui Li, Shenghua Bao, Jin Wang, Yuanjie Liu, Yong ...
CSE
2009
IEEE
15 years 8 months ago
Web Science 2.0: Identifying Trends through Semantic Social Network Analysis
—We introduce a novel set of social network analysis based algorithms for mining the Web, blogs, and online forums to identify trends and find the people launching these new tren...
Peter A. Gloor, Jonas Krauss, Stefan Nann, Kai Fis...
SAC
2004
ACM
15 years 7 months ago
Classifying biological articles using web resources
Text classification systems on biomedical literature aim to select relevant articles to a specific issue from large corpora. Most systems with an acceptable accuracy are based o...
Francisco M. Couto, Bruno Martins, Mário J....