Abstract. Geographic named entities can be classified into many subtypes that are useful for applications such as information extraction and question answering. In this paper, we ...
In this paper we present an approach to tackle three important problems of text normalization: sentence boundary disambiguation, disambiguation of capitalized words when they are ...
Abstract. A new interpretation of rules in rough set theory is introduced. According to the positive, boundary, and negative regions of a set, one can make a three-way decision: ac...
This paper describes an unsupervised algorithm for segmenting categorical time series. The algorithm first collects statistics about the frequency and boundary entropy of ngrams, t...
Service-oriented systems facilitate business workflows to span multiple organizations (e.g. by means of Web services). As a side effect, data may be more easily transferred over o...