training data | Sciweavers

8

NAACL
2007

80views Computational Linguistics» more NAACL 2007»

Virtual Evidence for Training Speech Recognizers Using Partially Labeled Data

13 years 6 months ago

Collecting supervised training data for automatic speech recognition (ASR) systems is both time consuming and expensive. In this paper we use the notion of virtual evidence in a g...

Amarnag Subramanya, Jeff A. Bilmes

claim paper

Read More »

13

click to vote

NAACL
2007

106views Computational Linguistics» more NAACL 2007»

Can Semantic Roles Generalize Across Genres?

13 years 6 months ago

Download acl.ldc.upenn.edu

PropBank has been widely used as training data for Semantic Role Labeling. However, because this training data is taken from the WSJ, the resulting machine learning models tend to...

Szu-ting Yi, Edward Loper, Martha Palmer

claim paper

Read More »

6

click to vote

NAACL
2007

95views Computational Linguistics» more NAACL 2007»

Using "Annotator Rationales" to Improve Machine Learning for Text Categorization

13 years 6 months ago

Download cs.jhu.edu

We propose a new framework for supervised machine learning. Our goal is to learn from smaller amounts of supervised training data, by collecting a richer kind of training data: an...

Omar Zaidan, Jason Eisner, Christine D. Piatko

claim paper

Read More »

8

click to vote

MSV
2007

114views Modeling And Simulation» more MSV 2007»

Assessment of ARMAX Structure as a Global Model for Self-Refilling Steam Distillation Essential Oil Extraction System

13 years 6 months ago

Download www.asprg.net

Abstract - In this paper, an essential oil extraction system with self-refilling system is modeled based on inputoutput data collected from a dedicated acquisition system. The ARMA...

Mohd Hezri Fazalul Rahiman, Mohd Nasir Taib, Yusof...

claim paper

Read More »

11

click to vote

LREC
2008

114views Education» more LREC 2008»

Improving Statistical Machine Translation Efficiency by Triangulation

13 years 6 months ago

Download www.lrec-conf.org

In current phrase-based Statistical Machine Translation systems, more training data is generally better than less. However, a larger data set eventually introduces a larger model ...

Yu Chen, Andreas Eisele, Martin Kay

claim paper

Read More »

8

click to vote

LREC
2008

84views Education» more LREC 2008»

Statistical Identification of English Loanwords in Korean Using Automatically Generated Training Data

13 years 6 months ago

Download www.lrec-conf.org

This paper describes an accurate, extensible method for automatically classifying unknown foreign words that requires minimal monolingual resources and no bilingual training data ...

Kirk Baker, Chris Brew

claim paper

Read More »

10

click to vote

LREC
2008

110views Education» more LREC 2008»

Cost-Sensitive Learning in Answer Extraction

13 years 6 months ago

Download www.lrec-conf.org

One problem of data-driven answer extraction in open-domain factoid question answering is that the class distribution of labeled training data is fairly imbalanced. This imbalance...

Michael Wiegand, Jochen L. Leidner, Dietrich Klako...

claim paper

Read More »

13

click to vote

ICMLA
2007

104views Machine Learning» more ICMLA 2007»

Scalable optimal linear representation for face and object recognition

13 years 6 months ago

Download ww2.cs.fsu.edu

Optimal Component Analysis (OCA) is a linear method for feature extraction and dimension reduction. It has been widely used in many applications such as face and object recognitio...

Yiming Wu, Xiuwen Liu, Washington Mio

claim paper

Read More »

5

click to vote

ICMLA
2007

80views Machine Learning» more ICMLA 2007»

Memory-based context-sensitive spelling correction at web scale

13 years 6 months ago

Download www.cs.cmu.edu

We study the problem of correcting spelling mistakes in text using memory-based learning techniques and a very large database of token n-gram occurrences in web text as training d...

Andrew Carlson, Ian Fette

claim paper

Read More »

15

click to vote

EMNLP
2007

114views Natural Language Processing» more EMNLP 2007»

Bootstrapping Information Extraction from Field Books

13 years 6 months ago

Download ilk.uvt.nl

We present two machine learning approaches to information extraction from semi-structured documents that can be used if no annotated training data are available, but there does ex...

Sander Canisius, Caroline Sporleder

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers