Sciweavers

30 search results - page 2 / 6
» Learning the Countability of English Nouns from Corpus Data
Sort
View
LREC
2010
169views Education» more  LREC 2010»
13 years 6 months ago
Using Comparable Corpora to Adapt a Translation Model to Domains
Statistical machine translation (SMT) requires a large parallel corpus, which is available only for restricted language pairs and domains. To expand the language pairs and domains...
Hiroyuki Kaji, Takashi Tsunakawa, Daisuke Okada
COOPIS
2003
IEEE
13 years 10 months ago
Mining for Lexons: Applying Unsupervised Learning Methods to Create Ontology Bases
Ontologies in current computer science parlance are computer based resources that represent agreed domain semantics. This paper first introduces ontologies in general and subseque...
Marie-Laure Reinberger, Peter Spyns, Walter Daelem...
ACL
2003
13 years 6 months ago
Exploiting Parallel Texts for Word Sense Disambiguation: An Empirical Study
A central problem of word sense disambiguation (WSD) is the lack of manually sense-tagged data required for supervised learning. In this paper, we evaluate an approach to automati...
Hwee Tou Ng, Bin Wang, Yee Seng Chan
AIME
2003
Springer
13 years 10 months ago
Learning Derived Words from Medical Corpora
Abstract. Morphological knowledge (inflection, derivation, compounds) is useful for medical language processing. Some is available for medical English in the UMLS Specialist Lexic...
Pierre Zweigenbaum, Natalia Grabar
LREC
2010
155views Education» more  LREC 2010»
13 years 6 months ago
Automatic Acquisition of Chinese Novel Noun Compounds
Automatic acquisition of novel compounds is notoriously difficult because most novel compounds have relatively low frequency in a corpus. The current study proposes a new method t...
Meng Wang, Chu-Ren Huang, Shiwen Yu, Weiwei Sun