Long Sentence Partitioning using Structure Analysis for Machine Translation

13 years 9 months ago

Download www.afnlp.org

in machine translation, long sentences are usually assumed to be difficult to treat. The main reason is the syntactic ambiguity which increases explosively as a sentence become longer. Especially, in the machine translation using sentence patterns, a long sentence causes a critical coverage problem. In this paper, we present a method of sentence partitioning which recognizes sub-sentence ranges by structure analysis, reducing the length of a sentence for translation. For the analysis of the clausal structure, phrase-level sentence patterns which have only a little syntactic ambiguities are employed. The structure analysis is conducted by the recognition of starting points of all clauses, dependency analysis, and depth analysis. Then, the ranges of sub-sentences are extracted based on the depth by stages. Our method was evaluated on 108 sentences extracted from CNN transcripts. It showed 85.2% accuracy in the detection of simple sentences.

Yoon-Hyung Roh, Young Ae Seo, Ki-Young Lee, Sung-K

Real-time Traffic

Long Sentence | Machine Translation | Natural Language Processing | NLPRS 2001 | Sentence Patterns |

claim paper

» Symmetric Pattern Matching Analysis for English Coordinate Structures

» PASBio predicateargument structures for event extraction in molecular biology

» A Criticality Analysis of Clustering in Superscalar Processors

» Conotoxin Protein Classification Using Free Scores of Words and Support Vector Machines

Post Info
More Details (n/a)

Added	30 Jul 2010
Updated	30 Jul 2010
Type	Conference
Year	2001
Where	NLPRS
Authors	Yoon-Hyung Roh, Young Ae Seo, Ki-Young Lee, Sung-Kwon Choi

Comments (0)

Sciweavers

Long Sentence Partitioning using Structure Analysis for Machine Translation

Long Sentence | Machine Translation | Natural Language Processing | NLPRS 2001 | Sentence Patterns |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers