Sciweavers

385 search results - page 16 / 77
» Improving data mining utility with projective sampling
Sort
View
69
Voted
SIGSOFT
2010
ACM
14 years 7 months ago
LINKSTER: enabling efficient manual inspection and annotation of mined data
While many uses of mined software engineering data are automatic in nature, some techniques and studies either require, or can be improved, by manual methods. Unfortunately, manua...
Christian Bird, Adrian Bachmann, Foyzur Rahman, Ab...
ESEM
2007
ACM
14 years 11 months ago
Mining Software Evolution to Predict Refactoring
Can we predict locations of future refactoring based on the development history? In an empirical study of open source projects we found that attributes of software evolution data ...
Jacek Ratzinger, Thomas Sigmund, Peter Vorburger, ...
WWW
2008
ACM
15 years 10 months ago
Can chinese web pages be classified with english data source?
As the World Wide Web in China grows rapidly, mining knowledge in Chinese Web pages becomes more and more important. Mining Web information usually relies on the machine learning ...
Xiao Ling, Gui-Rong Xue, Wenyuan Dai, Yun Jiang, Q...
ICDM
2005
IEEE
166views Data Mining» more  ICDM 2005»
15 years 3 months ago
Sequential Pattern Mining in Multiple Streams
In this paper, we deal with mining sequential patterns in multiple data streams. Building on a state-of-the-art sequential pattern mining algorithm PrefixSpan for mining transact...
Gong Chen, Xindong Wu, Xingquan Zhu
103
Voted
ICDM
2003
IEEE
210views Data Mining» more  ICDM 2003»
15 years 2 months ago
CBC: Clustering Based Text Classification Requiring Minimal Labeled Data
Semi-supervised learning methods construct classifiers using both labeled and unlabeled training data samples. While unlabeled data samples can help to improve the accuracy of trai...
Hua-Jun Zeng, Xuanhui Wang, Zheng Chen, Hongjun Lu...