To successfully prepare and model data, the data miner needs to be aware of the properties of the data manifold. In this chapter, the outline of a tool for automatically generating...
The problem of sequence categorization is to generalize from a corpus of labeled sequences procedures for accurately labeling future unlabeled sequences. The choice of representat...
A multilingual Internet-based employment advertisement system is described. Job ads are submitted as e-mailtexts, analysed by an example-based pattern matcher and stored in langua...
Harold L. Somers, Bill Black, Joakim Nivre, Torbj&...
PLANDoc, a system under joint development by Columbia and Bellcore, documents the activity of planning engineers as they study telephone routes. It takes as input a trace of the e...
This paper describes an attempt to model the method of generating a fact from the opinions of other persons or of institutions as a process which is based on knowledge about these...