Sciweavers

587 search results - page 41 / 118
» New Algorithms for Text Fingerprinting
Sort
View
MICAI
2010
Springer
14 years 7 months ago
Towards Document Plagiarism Detection Based on the Relevance and Fragmentation of the Reused Text
Traditionally, External Plagiarism Detection has been carried out by determining and measuring the similar sections between a given pair of documents, known as source and suspiciou...
Fernando Sánchez-Vega, Luis Villaseñ...
KDD
2002
ACM
126views Data Mining» more  KDD 2002»
15 years 10 months ago
Integrating feature and instance selection for text classification
Instance selection and feature selection are two orthogonal methods for reducing the amount and complexity of data. Feature selection aims at the reduction of redundant features i...
Dimitris Fragoudis, Dimitris Meretakis, Spiros Lik...
ACL
2003
14 years 11 months ago
Fast Methods for Kernel-Based Text Analysis
Kernel-based learning (e.g., Support Vector Machines) has been successfully applied to many hard problems in Natural Language Processing (NLP). In NLP, although feature combinatio...
Taku Kudo, Yuji Matsumoto
ESWA
2006
149views more  ESWA 2006»
14 years 9 months ago
An effective refinement strategy for KNN text classifier
Due to the exponential growth of documents on the Internet and the emergent need to organize them, the automated categorization of documents into predefined labels has received an...
Songbo Tan
SAC
2004
ACM
15 years 3 months ago
An optimized approach for KNN text categorization using P-trees
The importance of text mining stems from the availability of huge volumes of text databases holding a wealth of valuable information that needs to be mined. Text categorization is...
Imad Rahal, William Perrizo