Sciweavers

850 search results - page 5 / 170
» Representing Text Chunks
Sort
View
LREC
2008
132views Education» more  LREC 2008»
14 years 10 months ago
Babylon Parallel Text Builder: Gathering Parallel Texts for Low-Density Languages
This paper describes BABYLON, a system that attempts to overcome the shortage of parallel texts in low-density languages by supplementing existing parallel texts with texts gather...
Michael Mohler, Rada Mihalcea
LREC
2010
136views Education» more  LREC 2010»
14 years 10 months ago
Partial Parsing of Spontaneous Spoken French
This paper describes the process and the resources used to automatically annotate a French corpus of spontaneous speech transcriptions in super-chunks. Super-chunks are enhanced c...
Olivier Blanc, Matthieu Constant, Anne Dister, Pat...
NLE
2007
180views more  NLE 2007»
14 years 9 months ago
Segmentation and alignment of parallel text for statistical machine translation
We address the problem of extracting bilingual chunk pairs from parallel text to create training sets for statistical machine translation. We formulate the problem in terms of a s...
Yonggang Deng, Shankar Kumar, William Byrne
ECIR
2009
Springer
15 years 6 months ago
On Automatic Plagiarism Detection Based on n-Grams Comparison
Abstract. When automatic plagiarism detection is carried out considering a reference corpus, a suspicious text is compared to a set of original documents in order to relate the pla...
Alberto Barrón-Cedeño, Paolo Rosso