Sciweavers

10 search results - page 1 / 2
» Author identification: Using text sampling to handle the cla...
Sort
View
IPM
2008
196views more  IPM 2008»
13 years 4 months ago
Author identification: Using text sampling to handle the class imbalance problem
Authorship analysis of electronic texts assists digital forensics and anti-terror investigation. Author identification can be seen as a single-label multi-class text categorizatio...
Efstathios Stamatatos
ECAI
2006
Springer
13 years 8 months ago
Text Sampling and Re-Sampling for Imbalanced Authorship Identification Cases
Authorship identification can be seen as a single-label multi-class text categorization problem. Very often, there are extremely few training texts at least for some of the candida...
Efstathios Stamatatos
DEXAW
2007
IEEE
118views Database» more  DEXAW 2007»
13 years 11 months ago
Author Identification Using Imbalanced and Limited Training Texts
This paper deals with the problem of author identification. The Common N-Grams (CNG) method [6] is a language-independent profile-based approach with good results in many author i...
Efstathios Stamatatos
ECAI
2008
Springer
13 years 6 months ago
Author Identification Using a Tensor Space Representation
Author identification is a text categorization task with applications in intelligence, criminal law, computer forensics, etc. Usually, in such cases there is shortage of training t...
Spyridon Plakias, Efstathios Stamatatos
ISDA
2010
IEEE
13 years 2 months ago
Comparing SVM ensembles for imbalanced datasets
Real life datasets often suffer from the problem of class imbalance, which thwarts supervised learning process. In such data sets examples of positive (minority) class are signific...
Vasudha Bhatnagar, Manju Bhardwaj, Ashish Mahabal