Sciweavers

555 search results - page 40 / 111
» An Empirical Study on Web Mining of Parallel Data
Sort
View
VLDB
1999
ACM
188views Database» more  VLDB 1999»
15 years 1 months ago
SPIRIT: Sequential Pattern Mining with Regular Expression Constraints
Discovering sequential patterns is an important problem in data mining with a host of application domains including medicine, telecommunications, and the World Wide Web. Conventio...
Minos N. Garofalakis, Rajeev Rastogi, Kyuseok Shim
PAKDD
2010
ACM
167views Data Mining» more  PAKDD 2010»
15 years 2 months ago
Hierarchical Web-Page Clustering via In-Page and Cross-Page Link Structures
Abstract. Despite of the wide diversity of web-pages, web-pages residing in a particular organization, in most cases, are organized with semantically hierarchic structures. For exa...
Cindy Xide Lin, Yintao Yu, Jiawei Han, Bing Liu
DEXAW
2007
IEEE
92views Database» more  DEXAW 2007»
15 years 4 months ago
Subtree Testing and Closed Tree Mining Through Natural Representations
Several classical schemes exist to represent trees as strings over a fixed alphabet; these are useful in many algorithmic and conceptual studies. Our previous work has proposed a...
José L. Balcázar, Albert Bifet, Anto...
KDD
2009
ACM
248views Data Mining» more  KDD 2009»
15 years 2 months ago
PSkip: estimating relevance ranking quality from web search clickthrough data
1 In this article, we report our efforts in mining the information encoded as clickthrough data in the server logs to evaluate and monitor the relevance ranking quality of a commer...
Kuansan Wang, Toby Walker, Zijian Zheng
FLAIRS
2006
14 years 11 months ago
Using Web Searches on Important Words to Create Background Sets for LSI Classification
The world wide web has a wealth of information that is related to almost any text classification task. This paper presents a method for mining the web to improve text classificati...
Sarah Zelikovitz, Marina Kogan