This paper presents a Japanese information retrieval method using the dependency relationship between words and semantic and statistical information about them. Our method gives a...
This paper presents a statistical model for discovering topical clusters of words in unstructured text. The model uses a hierarchical Bayesian structure and it is also able to iden...
In statistical machine translation, the currently best performing systems are based in some way on phrases or word groups. We describe the baseline phrase-based translation system...
Incorporating background knowledge into data mining algorithms is an important but challenging problem. Current approaches in semi-supervised learning require explicit knowledge p...
Samah Jamal Fodeh, William F. Punch, Pang-Ning Tan
We present some of the technology developed at StreamSage for indexing and retrieving audio/video data. A primary difficulty of this task is precise extraction of the passages rel...
Anthony Davis, Philip Rennert, Robert Rubinoff, Ti...