Abstract. This paper proposes a two-step method for Chinese text categorization (TC). In the first step, a Naïve Bayesian classifier is used to fix the fuzzy area between two cate...
We have developed SummaryBIFF, a new e-mail delivery system for mobile phones that sends a summary of each newly arrived message and the URL connected to the HTML file converted f...
- Knowledge extraction methods have not efficiently evolved towards new methods to automate the process of building multilingual ontologies as the main representation of structured...
In this paper, we present an automated, quantitative, knowledge-poor method to evaluate the randomness of a collection of documents (corpus), with respect to a number of biased pa...
The Earley algorithm is a widely used parsing method in natural language processing applications. We introduce a variant of Earley parsing that is based on a “delayed” recognit...