Sciweavers

319 search results - page 17 / 64
» Distributional Features for Text Categorization
Sort
View
ESWA
2006
149views more  ESWA 2006»
14 years 11 months ago
An effective refinement strategy for KNN text classifier
Due to the exponential growth of documents on the Internet and the emergent need to organize them, the automated categorization of documents into predefined labels has received an...
Songbo Tan
AIMSA
2006
Springer
15 years 3 months ago
N-Gram Feature Selection for Authorship Identification
Automatic authorship identification offers a valuable tool for supporting crime investigation and security. It can be seen as a multi-class, single-label text categorization task. ...
John Houvardas, Efstathios Stamatatos
DMIN
2008
134views Data Mining» more  DMIN 2008»
15 years 1 months ago
Political Leaning Categorization by Exploring Subjectivities in Political Blogs
This paper addresses a relatively new text categorization problem: classifying a political blog as either `liberal' or `conservative', based on its political leaning. Ins...
Maojin Jiang, Shlomo Argamon
ICPR
2008
IEEE
16 years 27 days ago
A novel Gaussianized vector representation for natural scene categorization
This paper presents a novel Gaussianized vector representation for scene images by an unsupervised approach. First, each image is encoded as an ensemble of orderless bag of featur...
Hao Tang, Mark Hasegawa-Johnson, Thomas S. Huang, ...
IPM
2008
196views more  IPM 2008»
14 years 11 months ago
Author identification: Using text sampling to handle the class imbalance problem
Authorship analysis of electronic texts assists digital forensics and anti-terror investigation. Author identification can be seen as a single-label multi-class text categorizatio...
Efstathios Stamatatos