Free text botanical descriptions contained in printed floras can provide a wealth of valuable scientific information. In spite of this richness, these texts have seldom been anal...
Modeling of a business system has traditionally been based on free text documents. This work describes an elaborate experiment that constitutes a proof of concept to the idea that ...
Thesauri and ontologies provide important value in facilitating access to digital archives by representing underlying principles of organization. Translation of such resources int...
G. Craig Murray, Bonnie J. Dorr, Jimmy J. Lin, Jan...
Parallel corpora are a valuable resource for tasks such as cross-language information retrieval and data-driven natural language processing systems. Previously only small scale cor...
There has been relatively little work focused on determining the formality level of individual lexical items. This study applies information from large mixedgenre corpora, demonst...