Sources of training data suitable for language modeling of conversational speech are limited. In this paper, we show how training data can be supplemented with text from the web ď...
This paper presents a new enhanced text extraction algorithm from degraded document images on the basis of the probabilistic models. The observed document image is considered as a...
Here is discussed how to construct domain ontologies with both taxonomic and non-taxonomic conceptual relationships, exploiting a machinereadable dictionary (MRD) and domain-speci...
The techniques of information retrieval and information extraction are complementary, but to date there has been little concrete work aimed at integrating the two. We describe how...
The Text2Dialogue (T2D) system that we are developing allows digital content creators to generate attractive multi-modal dialogues presented by two virtual agents—by simply provi...
Paul Piwek, Hugo Hernault, Helmut Prendinger, Mits...