The implementation of word spotting is not an easy procedure and it gets even worse in the case of historical documents since it requires character recognition and indexing of the...
How can it be said that texts are "near" or "distant" from one another? Are different texts by a single author more similar than texts by different authors? To...
Use cases have become an important tool in software engineering. There has been much focus on the diagram notation but relatively little on use-case descriptions. As part of a welc...
This paper describes work on Named Entity Recognition (NER), in preparation for Relation Extraction (RE), on data from a historical archive organisation. As is often the case in t...
Author identification is a text categorization task with applications in intelligence, criminal law, computer forensics, etc. Usually, in such cases there is shortage of training t...