Sciweavers

41 search results - page 3 / 9
» Large Scale Parallel Document Mining for Machine Translation
Sort
View
IJCNLP
2005
Springer
13 years 10 months ago
Inversion Transduction Grammar Constraints for Mining Parallel Sentences from Quasi-Comparable Corpora
Abstract. We present a new implication of Wu’s (1997) Inversion Transduction Grammar (ITG) Hypothesis, on the problem of retrieving truly parallel sentence translations from larg...
Dekai Wu, Pascale Fung
EMNLP
2008
13 years 6 months ago
Language and Translation Model Adaptation using Comparable Corpora
Traditionally, statistical machine translation systems have relied on parallel bi-lingual data to train a translation model. While bi-lingual parallel data are expensive to genera...
Matthew G. Snover, Bonnie J. Dorr, Richard M. Schw...
LREC
2010
119views Education» more  LREC 2010»
13 years 6 months ago
Utilizing Semantic Equivalence Classes of Japanese Functional Expressions in Translation Rule Acquisition from Parallel Patent S
In the "Sandglass" MT architecture, we identify the class of monosemous Japanese functional expressions and utilize it in the task of translating Japanese functional exp...
Taiji Nagasaka, Ran Shimanouchi, Akiko Sakamoto, T...
MLDM
2009
Springer
13 years 12 months ago
PMCRI: A Parallel Modular Classification Rule Induction Framework
In a world where massive amounts of data are recorded on a large scale we need data mining technologies to gain knowledge from the data in a reasonable time. The Top Down Induction...
Frederic T. Stahl, Max A. Bramer, Mo Adda
KDD
2007
ACM
186views Data Mining» more  KDD 2007»
14 years 5 months ago
Content-based document routing and index partitioning for scalable similarity-based searches in a large corpus
We present a document routing and index partitioning scheme for scalable similarity-based search of documents in a large corpus. We consider the case when similarity-based search ...
Deepavali Bhagwat, Kave Eshghi, Pankaj Mehra