Sciweavers

237 search results - page 18 / 48
» acl 2008
Sort
View
ACL
2008
15 years 1 months ago
Language Dynamics and Capitalization using Maximum Entropy
This paper studies the impact of written language variations and the way it affects the capitalization task over time. A discriminative approach, based on maximum entropy models, ...
Fernando Batista, Nuno J. Mamede, Isabel Trancoso
ACL
2008
15 years 1 months ago
Icelandic Data Driven Part of Speech Tagging
Data driven POS tagging has achieved good performance for English, but can still lag behind linguistic rule based taggers for morphologically complex languages, such as Icelandic....
Mark Dredze, Joel Wallenberg
ACL
2008
15 years 1 months ago
Decompounding query keywords from compounding languages
Splitting compound words has proved to be useful in areas such as Machine Translation, Speech Recognition or Information Retrieval (IR). Furthermore, real-time IR systems (such as...
Enrique Alfonseca, Slaven Bilac, Stefan Pharies
ACL
2008
15 years 1 months ago
Randomized Language Models via Perfect Hash Functions
We propose a succinct randomized language model which employs a perfect hash function to encode fingerprints of n-grams and their associated probabilities, backoff weights, or oth...
David Talbot, Thorsten Brants
ACL
2008
15 years 1 months ago
Phrase Table Training for Precision and Recall: What Makes a Good Phrase and a Good Phrase Pair?
In this work, the problem of extracting phrase translation is formulated as an information retrieval process implemented with a log-linear model aiming for a balanced precision an...
Yonggang Deng, Jia Xu, Yuqing Gao