Sciweavers

14 search results - page 2 / 3
» A Phrase-Based Statistical Model for SMS Text Normalization
Sort
View
ICASSP
2011
IEEE
12 years 8 months ago
Toward text message normalization: Modeling abbreviation generation
This paper describes a text normalization system for deletion-based abbreviations in informal text. We propose using statistical classifiers to learn the probability of deleting ...
Deana Pennell, Yang Liu
IPM
2011
71views more  IPM 2011»
12 years 8 months ago
Improving semistatic compression via phrase-based modeling
In recent years, new semistatic word-based byte-oriented text compressors, such as Tagged Huffman and those based on Dense Codes, have shown that it is possible to perform fast d...
Nieves R. Brisaboa, Antonio Fariña, Gonzalo...
TREC
2004
13 years 6 months ago
Conceptual Language Models for Context-Aware Text Retrieval
While participating in the HARD track our first question was, what an IR-application should look like that takes into account preference meta-data from the user, without the need ...
Henning Rode, Djoerd Hiemstra
PRIS
2004
13 years 6 months ago
Effect of Feature Smoothing Methods in Text Classification Tasks
Abstract. The number of features to be considered in a text classification system is given by the size of the vocabulary and this is normally in the range of the tens or hundreds o...
David Vilar, Hermann Ney, Alfons Juan, Enrique Vid...
KDD
2005
ACM
125views Data Mining» more  KDD 2005»
14 years 5 months ago
Email data cleaning
Addressed in this paper is the issue of `email data cleaning' for text mining. Many text mining applications need take emails as input. Email data is usually noisy and thus i...
Jie Tang, Hang Li, Yunbo Cao, ZhaoHui Tang