Much information over the Internet is expressed by natural languages. The management of linguistic information involves an operation of comparison and aggregation. In this paper, ...
In response to the advance of ubiquitous computing technologies, we believe that for computer systems to be ubiquitous, they must be context-aware. In this paper, we address the i...
Arthur H. van Bunningen, Ling Feng, Peter M. G. Ap...
The indexation of documents is a critical step of the information retrieval process and is often a manual task which highly depends on the indexer’s knowledge. We propose to imp...
We analyze the persistence of information on the web, looking at the percentage of invalid URLs contained in academic articles within the CiteSeer (ResearchIndex) database. The nu...
Steve Lawrence, Frans Coetzee, Gary William Flake,...
We employ Automorphology, an MDL-based algorithm that determines the suffixes present in a language-sample with no prior knowledge of the language in question, and describe our exp...
John A. Goldsmith, Derrick Higgins, Svetlana Sogla...