Sciweavers

133 search results - page 7 / 27
» Joint Tokenization and Translation
Sort
View
LREC
2008
141views Education» more  LREC 2008»
15 years 1 months ago
New Resources for Document Classification, Analysis and Translation Technologies
The goal of the DARPA MADCAT (Multilingual Automatic Document Classification Analysis and Translation) Program is to automatically convert foreign language text images into Englis...
Stephanie Strassel, Lauren Friedman, Safa Ismael, ...
ACL
2011
14 years 3 months ago
A Large Scale Distributed Syntactic, Semantic and Lexical Language Model for Machine Translation
This paper presents an attempt at building a large scale distributed composite language model that simultaneously accounts for local word lexical information, mid-range sentence s...
Ming Tan, Wenli Zhou, Lei Zheng, Shaojun Wang
101
Voted
TASLP
2008
207views more  TASLP 2008»
14 years 11 months ago
Joint Morphological-Lexical Language Modeling for Processing Morphologically Rich Languages With Application to Dialectal Arabic
Language modeling for an inflected language such as Arabic poses new challenges for speech recognition and machine translation due to its rich morphology. Rich morphology results i...
Ruhi Sarikaya, Mohamed Afify, Yonggang Deng, Hakan...
RSP
2000
IEEE
156views Control Systems» more  RSP 2000»
15 years 4 months ago
Quasi-Static Scheduling of Reconfigurable Dataflow Graphs for DSP Systems
Dataflow programming has proven to be popular for representing applications in rapid prototyping tools for digital signal processing (DSP); however, existing dataflow design tools...
Bishnupriya Bhattacharya, Shuvra S. Bhattacharyya
78
Voted
ACL
2012
13 years 2 months ago
A Broad-Coverage Normalization System for Social Media Language
Social media language contains huge amount and wide variety of nonstandard tokens, created both intentionally and unintentionally by the users. It is of crucial importance to norm...
Fei Liu, Fuliang Weng, Xiao Jiang