Sciweavers

22121 search results - page 302 / 4425
» Modeling annotated data
Sort
View
NAACL
2001
15 years 18 days ago
Applying Co-Training Methods to Statistical Parsing
We propose a novel Co-Training method for statistical parsing. The algorithm takes as input a small corpus (9695 sentences) annotated with parse trees, a dictionary of possible le...
Anoop Sarkar
ACL
2010
14 years 9 months ago
Bucking the Trend: Large-Scale Cost-Focused Active Learning for Statistical Machine Translation
We explore how to improve machine translation systems by adding more translation data in situations where we already have substantial resources. The main challenge is how to buck ...
Michael Bloodgood, Chris Callison-Burch
LREC
2010
173views Education» more  LREC 2010»
15 years 21 days ago
Heterogeneous Data Sources for Signed Language Analysis and Synthesis: The SignCom Project
This paper describes how heterogeneous data sources captured in the SignCom project may be used for the analysis and synthesis of French Sign Language (LSF) utterances. The captur...
Kyle Duarte, Sylvie Gibet
OTM
2009
Springer
15 years 5 months ago
Contemporary Challenges in Ambient Data Integration for Biodiversity Informatics
Biodiversity informatics (BDI) information is both highly localized and highly distributed. The temporal and spatial contexts of data collection events are generally of primary imp...
David Thau, Robert A. Morris, Sean White
ANLP
1997
116views more  ANLP 1997»
15 years 18 days ago
A Maximum Entropy Approach to Identifying Sentence Boundaries
We present a trainable model for identifying sentence boundaries in raw text. Given a corpus annotated with sentence boundaries, our model learns to classify each occurrence of., ...
Jeffrey C. Reynar, Adwait Ratnaparkhi