Sciweavers

PAKDD
2004
ACM
183views Data Mining» more  PAKDD 2004»
13 years 9 months ago
Constraint-Based Graph Clustering through Node Sequencing and Partitioning
This paper proposes a two-step graph partitioning method to discover constrained clusters with an objective function that follows the well-known minmax clustering principle. Compar...
Yu Qian, Kang Zhang, Wei Lai
PAKDD
2004
ACM
96views Data Mining» more  PAKDD 2004»
13 years 9 months ago
Spectral Energy Minimization for Semi-supervised Learning
The use of unlabeled data to aid classification is important as labeled data is often available in limited quantity. Instead of utilizing training samples directly into semi-super...
Chun Hung Li, Zhi-Li Wu
PAKDD
2004
ACM
131views Data Mining» more  PAKDD 2004»
13 years 9 months ago
A Tree-Based Approach to the Discovery of Diagnostic Biomarkers for Ovarian Cancer
Computational diagnosis of cancer is a classification problem, and it has two special requirements on a learning algorithm: perfect accuracy and small number of features used in t...
Jinyan Li, Kotagiri Ramamohanarao
PAKDD
2004
ACM
83views Data Mining» more  PAKDD 2004»
13 years 9 months ago
Providing Diversity in K-Nearest Neighbor Query Results
Abstract. Given a point query Q in multi-dimensional space, K-Nearest Neighbor (KNN) queries return the K closest answers in the database with respect to Q. In this scenario, it is...
Anoop Jain, Parag Sarda, Jayant R. Haritsa
PAKDD
2004
ACM
131views Data Mining» more  PAKDD 2004»
13 years 9 months ago
Mining of Web-Page Visiting Patterns with Continuous-Time Markov Models
This paper presents a new prediction model for predicting when an online customer leaves a current page and which next Web page the customer will visit. The model can forecast the ...
Qiming Huang, Qiang Yang, Joshua Zhexue Huang, Mic...
PAKDD
2004
ACM
105views Data Mining» more  PAKDD 2004»
13 years 9 months ago
Extracting and Explaining Biological Knowledge in Microarray Data
Paul J. Kennedy, Simeon J. Simoff, David B. Skilli...
PAKDD
2004
ACM
94views Data Mining» more  PAKDD 2004»
13 years 9 months ago
Clustering Multi-represented Objects with Noise
Abstract. Traditional clustering algorithms are based on one representation space, usually a vector space. However, in a variety of modern applications, multiple representations ex...
Karin Kailing, Hans-Peter Kriegel, Alexey Pryakhin...
PAKDD
2004
ACM
94views Data Mining» more  PAKDD 2004»
13 years 9 months ago
Towards Optimizing Conjunctive Inductive Queries
Inductive queries are queries to an inductive database that generate a set of patterns in a data mining context. Inductive querying poses new challenges to database and data mining...
Johannes Fischer, Luc De Raedt
PAKDD
2004
ACM
199views Data Mining» more  PAKDD 2004»
13 years 9 months ago
Temporal Sequence Associations for Rare Events
In many real world applications, systematic analysis of rare events, such as credit card frauds and adverse drug reactions, is very important. Their low occurrence rate in large da...
Jie Chen, Hongxing He, Graham J. Williams, Huidong...
PAKDD
2004
ACM
186views Data Mining» more  PAKDD 2004»
13 years 9 months ago
CMTreeMiner: Mining Both Closed and Maximal Frequent Subtrees
Abstract. Tree structures are used extensively in domains such as computational biology, pattern recognition, XML databases, computer networks, and so on. One important problem in ...
Yun Chi, Yirong Yang, Yi Xia, Richard R. Muntz