Sciweavers

ICASSP
2011
IEEE
12 years 8 months ago
Unsupervised determination of efficient Korean LVCSR units using a Bayesian Dirichlet process model
Korean is an agglutinative language that does not have explicit word boundaries. It is also a highly inflective language that exhibits severe coarticulation effects. These charac...
Sakriani Sakti, Andrew M. Finch, Ryosuke Isotani, ...
COLING
2010
12 years 11 months ago
Semantic Classification of Automatically Acquired Nouns using Lexico-Syntactic Clues
In this paper, we present a two-stage approach to acquire Japanese unknown morphemes from text with full POS tags assigned to them. We first acquire unknown morphemes only making ...
Yugo Murawaki, Sadao Kurohashi
ACTAC
2006
126views more  ACTAC 2006»
13 years 4 months ago
Named Entity Recognition for Hungarian Using Various Machine Learning Algorithms
In this paper we introduce a statistical Named Entity recognizer (NER) system for the Hungarian language. We examined three methods for identifying and disambiguating proper nouns...
Richárd Farkas, György Szarvas, Andr&a...
ACL
1998
13 years 6 months ago
Locating Noun Phrases with Finite State Transducers
We present a method for constructing, maintaining and consulting a database of proper nouns. We describe noun phrases composed of a proper noun and/or a description of a human occ...
Jean Senellart
ACL
2003
13 years 6 months ago
Semantic Classification of Chinese Unknown Words
This paper describes a classifier that assigns semantic thesaurus categories to unknown Chinese words (words not already in the CiLin thesaurus and the Chinese Electronic Dictiona...
Huihsin Tseng
LREC
2008
176views Education» more  LREC 2008»
13 years 6 months ago
Towards Spanish Verbs' Selectional Preferences Automatic Acquisition: Semantic Annotation of the SenSem Corpus
We present the results of an agreement task carried out in the framework of the KNOW Project and consisting in manually annotating an agreement sample totaling 50 sentences extrac...
Jordi Carrera, Irene Castellón, Salvador Cl...
EMNLP
2007
13 years 6 months ago
Learning to Find English to Chinese Transliterations on the Web
We present a method for learning to find English to Chinese transliterations on the Web. In our approach, proper nouns are expanded into new queries aimed at maximizing the probab...
Jian-Cheng Wu, Jason S. Chang
NLPRS
2001
Springer
13 years 9 months ago
Cross-Language Information Retrieval of Proper Nouns using Context Information
Translating news articles frequently involves finding foreign language equivalents for proper nouns occurring for the first time in an original article, a time-consuming and labor...
Isao Goto, Noriyoshi Uratani, Terumasa Ehara
CLEANDB
2006
ACM
145views Database» more  CLEANDB 2006»
13 years 10 months ago
Cleansing Databases of Misspelled Proper Nouns
The paper presents a data cleansing technique for string databases. We propose and evaluate an algorithm that identifies a group of strings that consists of (multiple) occurrence...
Arturas Mazeika, Michael H. Böhlen