Abstract. In this paper, the authors address the permutation ambiguity that exists in frequency domain Independent Component Analysis of convolutive mixtures. Many methods have bee...
Abstract. Finding near-duplicate images is a task often found in Multimedia Information Retrieval (MIR). Toward this effort, we propose a novel idea by bridging two seemingly unrel...
Hung-sik Kim, Hau-Wen Chang, Jeongkyu Lee, Dongwon...
The importance of sentence-aligned parallel corpora has been widely acknowledged. Reference corpora in which sub-sentential translational correspondences are indicated manually ar...
In this paper, we will present an efficient method to compute the co-occurrence counts of any pair of substring in a parallel corpus, and an algorithm that make use of these count...
We introduce a semi-supervised approach to training for statistical machine translation that alternates the traditional Expectation Maximization step that is applied on a large tr...