Sciweavers

735 search results - page 10 / 147
» Corpora and data preparation
Sort
View
ACL
2004
14 years 11 months ago
Statistical Machine Translation with Word- and Sentence-Aligned Parallel Corpora
The parameters of statistical translation models are typically estimated from sentence-aligned parallel corpora. We show that significant improvements in the alignment and transla...
Chris Callison-Burch, David Talbot, Miles Osborne
IEEEVAST
2010
14 years 4 months ago
Understanding text corpora with multiple facets
Text visualization becomes an increasingly more important research topic as the need to understand massive-scale textual information is proven to be imperative for many people and...
Lei Shi, Furu Wei, Shixia Liu, Li Tan, Xiaoxiao Li...
KDD
2010
ACM
233views Data Mining» more  KDD 2010»
15 years 1 months ago
Evolutionary hierarchical dirichlet processes for multiple correlated time-varying corpora
Mining cluster evolution from multiple correlated time-varying text corpora is important in exploratory text analytics. In this paper, we propose an approach called evolutionary h...
Jianwen Zhang, Yangqiu Song, Changshui Zhang, Shix...
ACL
2011
14 years 1 months ago
Joint Bilingual Sentiment Classification with Unlabeled Parallel Corpora
Most previous work on multilingual sentiment analysis has focused on methods to adapt sentiment resources from resource-rich languages to resource-poor languages. We present a nov...
Bin Lu, Chenhao Tan, Claire Cardie, Benjamin K. Ts...
HCI
2009
14 years 7 months ago
Sign Language Recognition: Working with Limited Corpora
The availability of video format sign language corpora limited. This leads to a desire for techniques which do not rely on large, fully-labelled datasets. This paper covers various...
Helen Cooper, Richard Bowden