Sciweavers

ACL
2006
13 years 7 months ago
Maximum Entropy Based Restoration of Arabic Diacritics
Short vowels and other diacritics are not part of written Arabic scripts. Exceptions are made for important political and religious texts and in scripts for beginning students of ...
Imed Zitouni, Jeffrey S. Sorensen, Ruhi Sarikaya
ACL
2006
13 years 7 months ago
The Effect of Translation Quality in MT-Based Cross-Language Information Retrieval
This paper explores the relationship between the translation quality and the retrieval effectiveness in Machine Translation (MT) based Cross-Language Information Retrieval (CLIR)....
Jiang Zhu, Haifeng Wang
ACL
2006
13 years 7 months ago
Modeling Commonality among Related Classes in Relation Extraction
This paper proposes a novel hierarchical learning strategy to deal with the data sparseness problem in relation extraction by modeling the commonality among related classes. For e...
Guodong Zhou, Jian Su, Min Zhang
ACL
2006
13 years 7 months ago
BiTAM: Bilingual Topic AdMixture Models for Word Alignment
We propose a novel bilingual topical admixture (BiTAM) formalism for word alignment in statistical machine translation. Under this formalism, the parallel sentence-pairs within a ...
Bing Zhao, Eric P. Xing
ACL
2006
13 years 7 months ago
A Composite Kernel to Extract Relations between Entities with Both Flat and Structured Features
This paper proposes a novel composite kernel for relation extraction. The composite kernel consists of two individual kernels: an entity kernel that allows for entity-related feat...
Min Zhang, Jie Zhang, Jian Su, Guodong Zhou
ACL
2006
13 years 7 months ago
A Progressive Feature Selection Algorithm for Ultra Large Feature Spaces
Recent developments in statistical modeling of various linguistic phenomena have shown that additional features give consistent performance improvements. Quite often, improvements...
Qi Zhang, Fuliang Weng, Zhe Feng
ACL
2006
13 years 7 months ago
Subword-Based Tagging for Confidence-Dependent Chinese Word Segmentation
We proposed a subword-based tagging for Chinese word segmentation to improve the existing character-based tagging. The subword-based tagging was implemented using the maximum entr...
Ruiqiang Zhang, Gen-ichiro Kikui, Eiichiro Sumita
ACL
2006
13 years 7 months ago
Inducing Word Alignments with Bilexical Synchronous Trees
This paper compares different bilexical tree-based models for bilingual alignment. EM training for the new model benefits from the dynamic programming "hook trick". The ...
Hao Zhang, Daniel Gildea
ACL
2006
13 years 7 months ago
Automatic Learning of Textual Entailments with Cross-Pair Similarities
In this paper we define a novel similarity measure between examples of textual entailments and we use it as a kernel function in Support Vector Machines (SVMs). This allows us to ...
Fabio Massimo Zanzotto, Alessandro Moschitti
ACL
2006
13 years 7 months ago
Discursive Usage of Six Chinese Punctuation Marks
Both rhetorical structure and punctuation have been helpful in discourse processing. Based on a corpus annotation project, this paper reports the discursive usage of 6 Chinese pun...
Ming Yue