We describe here a method for automatically identifying word sense variation in a dated collection of historical books in a large digital library. By leveraging a small set of kno...
Many important application areas of text classifiers demand high precision and it is common to compare prospective solutions to the performance of Naive Bayes. This baseline is us...
We have developed a middleware framework for workgroup environments that can support distributed software development and a variety of other application domains requiring document...
Click data captures many users’ document preferences for a query and has been shown to help significantly improve search engine ranking. However, most click data is noisy and of...
Federated text search provides a unified search interface for multiple search engines of distributed text information sources. Resource selection is an important component for fed...