Sciweavers

98
Voted
EMNLP
2010
14 years 11 months ago
Discriminative Sample Selection for Statistical Machine Translation
Production of parallel training corpora for the development of statistical machine translation (SMT) systems for resource-poor languages usually requires extensive manual effort. ...
Sankaranarayanan Ananthakrishnan, Rohit Prasad, Da...
EOR
2007
85views more  EOR 2007»
15 years 29 days ago
Reject inference, augmentation, and sample selection
Many researchers see the need for reject inference in credit scoring models to come from a sample selection problem whereby a missing variable results in omitted variable bias. Al...
John Banasik, Jonathan Crook
87
Voted
ACL
1996
15 years 2 months ago
Minimizing Manual Annotation Cost in Supervised Training from Corpora
Corpus-based methods for natural language processing often use supervised training, requiring expensive manual annotation of training corpora. This paper investigates methods for ...
Sean P. Engelson, Ido Dagan
SDM
2008
SIAM
122views Data Mining» more  SDM 2008»
15 years 2 months ago
Type-Independent Correction of Sample Selection Bias via Structural Discovery and Re-balancing
Sample selection bias is a common problem in many real world applications, where training data are obtained under realistic constraints that make them follow a different distribut...
Jiangtao Ren, Xiaoxiao Shi, Wei Fan, Philip S. Yu