Sciweavers

29 search results - page 1 / 6
» Experiments on Processing Overlapping Parallel Corpora
Sort
View
63
Voted
LREC
2008
115views Education» more  LREC 2008»
14 years 11 months ago
Experiments on Processing Overlapping Parallel Corpora
The number and sizes of parallel corpora keep growing, which makes it necessary to have automatic methods of processing them: combining, checking and improving corpora quality, et...
Mark Fishel, Heiki Jaan Kaalep
IR
2008
14 years 9 months ago
Focused web crawling in the acquisition of comparable corpora
CLIR resources, such as dictionaries and parallel corpora, are scarce for special domains. Obtaining comparable corpora automatically for such domains could be an answer to this p...
Tuomas Talvensaari, Ari Pirkola, Kalervo Järv...
ICCPOL
2009
Springer
15 years 2 months ago
Constructing Parallel Corpus from Movie Subtitles
Abstract. This paper describes a methodology for constructing aligned German-Chinese corpora from movie subtitles. The corpora will be used to train a special machine translation s...
Han Xiao, Xiaojie Wang
89
Voted
LTCONF
2007
Springer
15 years 3 months ago
Leveraging Parallel Corpora and Existing Wordnets for Automatic Construction of the Slovene Wordnet
The paper reports on a series of experiments conducted in order to test the feasibility of automatically generating synsets for Slovene wordnet. The resources used were the multil...
Darja Fiser
TSD
2007
Springer
15 years 3 months ago
On the Relative Hardness of Clustering Corpora
Abstract. Clustering is often considered the most important unsupervised learning problem and several clustering algorithms have been proposed over the years. Many of these algorit...
David Pinto, Paolo Rosso