Sciweavers

423 search results - page 68 / 85
» Text Classification by Labeling Words
Sort
View
CLEF
2010
Springer
15 years 4 months ago
ZOT! to Wikipedia Vandalism - Lab Report for PAN at CLEF 2010
Abstract This vandalism detector uses features primarily derived from a wordpreserving differencing of the text for each Wikipedia article from before and after the edit, along wit...
James White, Rebecca Maessen
162
Voted
ICASSP
2011
IEEE
14 years 7 months ago
A hierarchical generative model for Generic Audio Document Categorization
In this paper, we call the pattern classification problem that consists in assigning a category label to a long audio signal based on its semantic content as Generic Audio Documen...
Zhi Zeng, Shuwu Zhang
100
Voted
ICDAR
2009
IEEE
15 years 10 months ago
Classifying Foreground Pixels in Document Images
We present a system that classifies pixels in a document image according to marking type such as machine print, handwriting, and noise. A segmenter module first splits an input ...
Prateek Sarkar, Eric Saund, Jing Lin
COMAD
2009
15 years 4 months ago
Business Insight from Collection of Unstructured Formatted Documents with IBM Content Harvester
In this paper, we report the development and experiments of IBM Content Harvester (CH), a tool to analyze and recover templates and content from word processor created text docume...
Biplav Srivastava, Yuan-Chi Chang
GECCO
2005
Springer
139views Optimization» more  GECCO 2005»
15 years 9 months ago
Use of a genetic algorithm in brill's transformation-based part-of-speech tagger
The tagging problem in natural language processing is to find a way to label every word in a text as a particular part of speech, e.g., proper noun. An effective way of solving th...
Garnett Carl Wilson, Malcolm I. Heywood