Sciweavers

129 search results - page 12 / 26
» A Corpus of Scope-disambiguated English Text
Sort
View
LREC
2010
173views Education» more  LREC 2010»
14 years 11 months ago
From Speech to Trees: Applying Treebank Annotation to Arabic Broadcast News
The Arabic Treebank (ATB) Project at the Linguistic Data Consortium (LDC) has embarked on a large corpus of Broadcast News (BN) transcriptions, and this has led to a number of new...
Mohamed Maamouri, Ann Bies, Seth Kulick, Wajdi Zag...
ACL
2006
14 years 11 months ago
A Phrase-Based Statistical Model for SMS Text Normalization
Short Messaging Service (SMS) texts behave quite differently from normal written texts and have some very special phenomena. To translate SMS texts, traditional approaches model s...
AiTi Aw, Min Zhang, Juan Xiao, Jian Su
ACL
2006
14 years 11 months ago
A FrameNet-Based Semantic Role Labeler for Swedish
We present a FrameNet-based semantic role labeling system for Swedish text. As training data for the system, we used an annotated corpus that we produced by transferring FrameNet ...
Richard Johansson, Pierre Nugues
86
Voted
DCC
2001
IEEE
15 years 9 months ago
LIPT: A Reversible Lossless Text Transform to Improve Compression Performance
Lossless compression researchers have developed highly sophisticated approaches, such as Huffman encoding, arithmetic encoding, the Lempel-Ziv family, Dynamic Markov Compression (D...
Fauzia S. Awan, Nan Zhang 0005, Nitin Motgi, Raja ...
LREC
2008
105views Education» more  LREC 2008»
14 years 11 months ago
Linguistic Resources for Reconstructing Spontaneous Speech Text
The output of a speech recognition system is not always ideal for subsequent downstream processing, in part because speakers themselves often make mistakes. A system would accompl...
Erin Fitzgerald, Frederick Jelinek