Internet content today is about 80% text-based. No matter static or dynamic, the information is encoded and presented as multilingual, unstructured natural language text pages. As ...
Pavlin Dobrev, Albena Strupchanska, Galia Angelova
Abstract. We present an approach to define template languages for generating syntactically correct code. In the first part of the paper, we define the syntax and semantics of a ...
In this paper we propose a domainindependent text segmentation method, which consists of three components. Latent Dirichlet allocation (LDA) is employed to compute words semantic ...
The paper offers a new type of approach to the semantic phenomenon of adverbial aspect shift within the framework of finitestate temporal semantics. The heart of the proposal is a...