Sciweavers

EMNLP
2010
13 years 2 months ago
Discriminative Sample Selection for Statistical Machine Translation
Production of parallel training corpora for the development of statistical machine translation (SMT) systems for resource-poor languages usually requires extensive manual effort. ...
Sankaranarayanan Ananthakrishnan, Rohit Prasad, Da...
EOR
2007
85views more  EOR 2007»
13 years 4 months ago
Reject inference, augmentation, and sample selection
Many researchers see the need for reject inference in credit scoring models to come from a sample selection problem whereby a missing variable results in omitted variable bias. Al...
John Banasik, Jonathan Crook
ACL
1996
13 years 5 months ago
Minimizing Manual Annotation Cost in Supervised Training from Corpora
Corpus-based methods for natural language processing often use supervised training, requiring expensive manual annotation of training corpora. This paper investigates methods for ...
Sean P. Engelson, Ido Dagan
SDM
2008
SIAM
122views Data Mining» more  SDM 2008»
13 years 6 months ago
Type-Independent Correction of Sample Selection Bias via Structural Discovery and Re-balancing
Sample selection bias is a common problem in many real world applications, where training data are obtained under realistic constraints that make them follow a different distribut...
Jiangtao Ren, Xiaoxiao Shi, Wei Fan, Philip S. Yu