: This work focuses on clustering a site into groups of documents that are predictive of future user accesses. Two approaches have been developed and tested. The first approach use...
We present a novel approach for multilingual document clustering using only comparable corpora to achieve cross-lingual semantic interoperability. The method models document colle...
In this paper, we explored how to use meta-data information in information retrieval task. We presented a new language model that is able to take advantage of the category informa...
Rong Jin, Luo Si, Alexander G. Hauptmann, James P....
Abstract. In this paper we propose the clustering of top-ranking sentences (TRS) for effective information access. Top-ranking sentences are selected by a query-biased sentence ex...