This paper describes a text normalization system for deletion-based abbreviations in informal text. We propose using statistical classifiers to learn the probability of deleting ...
Text mining appliesthe sameanalytical functions of datamining to the domainof textual information, relying on sophisticatedtext analysis techniques that distill information from f...
While much of the data on the web is unstructured in nature, there is also a significant amount of embedded structured data, such as product information on e-commerce sites or sto...
A novel text input for public displays and palmtop computers is presented that separates the input from the display of the edited text. While the public display shows both the edi...
Computer-generated texts are yet far from human-generated ones. Along with the limited use of vocabulary and syntactic structures sent, their lack of creativeness and abstraction i...