Sciweavers

319 search results - page 23 / 64
» Distributional Features for Text Categorization
Sort
View
PRIS
2004
15 years 1 months ago
Effect of Feature Smoothing Methods in Text Classification Tasks
Abstract. The number of features to be considered in a text classification system is given by the size of the vocabulary and this is normally in the range of the tens or hundreds o...
David Vilar, Hermann Ney, Alfons Juan, Enrique Vid...
EMNLP
2004
15 years 1 months ago
Unsupervised Domain Relevance Estimation for Word Sense Disambiguation
This paper presents Domain Relevance Estimation (DRE), a fully unsupervised text categorization technique based on the statistical estimation of the relevance of a text with respe...
Alfio Massimiliano Gliozzo, Bernardo Magnini, Carl...
ICB
2007
Springer
120views Biometrics» more  ICB 2007»
15 years 3 months ago
Online Text-Independent Writer Identification Based on Stroke's Probability Distribution Function
Abstract. This paper introduces a novel method for online writer identification. Traditional methods make use of the distribution of directions in handwritten traces. The novelty o...
Bangyu Li, Zhenan Sun, Tieniu Tan
EMNLP
2008
15 years 1 months ago
Soft-Supervised Learning for Text Classification
We propose a new graph-based semisupervised learning (SSL) algorithm and demonstrate its application to document categorization. Each document is represented by a vertex within a ...
Amarnag Subramanya, Jeff Bilmes
KDD
2006
ACM
179views Data Mining» more  KDD 2006»
16 years 4 days ago
Extracting key-substring-group features for text classification
In many text classification applications, it is appealing to take every document as a string of characters rather than a bag of words. Previous research studies in this area mostl...
Dell Zhang, Wee Sun Lee