Sciweavers

311 search results - page 6 / 63
» Cleaning Web Pages for Effective Web Content Mining
Sort
View
INAP
2001
Springer
15 years 1 months ago
A Modern Approach to Searching the World Wide Web: Ranking Pages by Inference over Content
The Hypertext-based Webs such as Intranets contain a vast amount of information pertaining to an enormous number of subjects. It is, however, an organically grown and thus essentia...
Bronson Trevor, Edgar Weippl, Werner Winiwarter
SMC
2010
IEEE
198views Control Systems» more  SMC 2010»
14 years 7 months ago
Deep web data extraction
—Deep Web contents are accessed by queries submitted to Web databases and the returned data records are enwrapped in dynamically generated Web pages (they will be called deep Web...
Jer Lang Hong
WWW
2008
ACM
15 years 10 months ago
iRobot: an intelligent crawler for web forums
We study in this paper the Web forum crawling problem, which is a very fundamental step in many Web applications, such as search engine and Web data mining. As a typical user-crea...
Rui Cai, Jiang-Ming Yang, Wei Lai, Yida Wang, Lei ...
ICDM
2002
IEEE
162views Data Mining» more  ICDM 2002»
15 years 2 months ago
Recognition of Common Areas in a Web Page Using Visual Information: a possible application in a page classification
Extracting and processing information from web pages is an important task in many areas like constructing search engines, information retrieval, and data mining from the Web. Comm...
Milos Kovacevic, Michelangelo Diligenti, Marco Gor...
JDWM
2010
139views more  JDWM 2010»
14 years 7 months ago
Mining Frequent Generalized Patterns for Web Personalization in the Presence of Taxonomies
The Web is a continuously evolving environment, since its content is updated on a regular basis. As a result, the traditional usagebased approach to generate recommendations that ...
Panagiotis Giannikopoulos, Iraklis Varlamis, Magda...