Sciweavers

139 search results - page 12 / 28
» An Empirical Comparison of Four Text Mining Methods
Sort
View
GFKL
2005
Springer
167views Data Mining» more  GFKL 2005»
15 years 5 months ago
Quantitative Text Typology: The Impact of Sentence Length
Abstract. This study focuses on the contribution of sentence length for a quantitative text typology. Therefore, 333 Slovenian texts are analyzed with regard to their sentence leng...
Emmerich Kelih, Peter Grzybek, Gordana Antic, Erns...
KDD
2004
ACM
103views Data Mining» more  KDD 2004»
16 years 3 days ago
An objective evaluation criterion for clustering
We propose and test an objective criterion for evaluation of clustering performance: How well does a clustering algorithm run on unlabeled data aid a classification algorithm? The...
Arindam Banerjee, John Langford
ACL
2009
14 years 9 months ago
Extracting Paraphrases of Technical Terms from Noisy Parallel Software Corpora
In this paper, we study the problem of extracting technical paraphrases from a parallel software corpus, namely, a collection of duplicate bug reports. Paraphrase acquisition is a...
Xiaoyin Wang, David Lo, Jing Jiang, Lu Zhang, Hong...
BMCBI
2006
146views more  BMCBI 2006»
14 years 11 months ago
Recursive gene selection based on maximum margin criterion: a comparison with SVM-RFE
Background: In class prediction problems using microarray data, gene selection is essential to improve the prediction accuracy and to identify potential marker genes for a disease...
Satoshi Niijima, Satoru Kuhara
MEDINFO
2007
15 years 1 months ago
Using Discourse Analysis to Improve Text Categorization in MEDLINE
PROBLEM: Automatic keyword assignment has been largely studied in medical informatics in the context of the MEDLINE database, both for helping search in MEDLINE and in order to pr...
Patrick Ruch, Antoine Geissbühler, Julien Gob...