Medical text mining has gained increasing interest in recent years. Radiology reports contain rich information describing radiologist’s observations on the patient’s medical c...
StreamMine is a scalable middleware for massive real-time data streaming. In this paper we present the BFSiena: a communication substrate for the StreamMine. BFSiena is a content-...
Weintroducecoactive learning as a distributed learning approachto data miningin networkedand distributed databases. Thecoactive learningalgorithmsact on independent data sets and ...
We propose a novel model to automatically extract transliteration pairs from parallel corpora. Our model is efficient, language pair independent and mines transliteration pairs i...
Multi-core processors are proliferated across different domains in recent years. In this paper, we study the performance of frequent pattern mining on a modern multi-core machine....