Sciweavers

COLING
2010
12 years 11 months ago
Acquisition of Unknown Word Paradigms for Large-Scale Grammars
Unknown words are a major issue for large-scale grammars of natural language. We propose a machine learning based algorithm for acquiring lexical entries for all forms in the para...
Kostadin Cholakov, Gertjan van Noord
COLING
2010
12 years 11 months ago
A Multi-Domain Web-Based Algorithm for POS Tagging of Unknown Words
We present a web-based algorithm for the task of POS tagging of unknown words (words appearing only a small number of times in the training data of a supervised POS tagger). When ...
Shulamit Umansky-Pesin, Roi Reichart, Ari Rappopor...
COLING
2002
13 years 4 months ago
Unknown Word Extraction for Chinese Documents
There is no blank to mark word boundaries in Chinese text. As a result, identifying words is difficult, because of segmentation ambiguities and occurrences of unknown words. Conve...
Keh-Jiann Chen, Wei-Yun Ma
JETAI
2007
111views more  JETAI 2007»
13 years 4 months ago
Contextual vocabulary acquisition as computational philosophy and as philosophical computation
Contextual vocabulary acquisition (CVA) is the active, deliberate acquisition of a meaning for an unknown word in a text by reasoning from textual clues, prior knowledge, and hypo...
William J. Rapaport, Michael W. Kibby
NAACL
2003
13 years 5 months ago
Unsupervised methods for developing taxonomies by combining syntactic and statistical information
This paper describes an unsupervised algorithm for placing unknown words into a taxonomy and evaluates its accuracy on a large and varied sample of words. The algorithm works by ï...
Dominic Widdows
ACL
2003
13 years 5 months ago
Semantic Classification of Chinese Unknown Words
This paper describes a classifier that assigns semantic thesaurus categories to unknown Chinese words (words not already in the CiLin thesaurus and the Chinese Electronic Dictiona...
Huihsin Tseng
LREC
2008
144views Education» more  LREC 2008»
13 years 5 months ago
A Hybrid Morphology-Based POS Tagger for Persian
In many applications of natural language processing (NLP) grammatically tagged corpora are needed. Thus Part of Speech (POS) Tagging is of high importance in the domain of NLP. Ma...
Mehrnoush Shamsfard, Hakimeh Fadaei
LREC
2010
171views Education» more  LREC 2010»
13 years 5 months ago
AutoTagTCG : A Framework for Automatic Thai CG Tagging
Recently, categorical grammar has been focused as a powerful grammar. This paper aims to develop a framework for automatic CG tagging for Thai. We investigated two main algorithms...
Thepchai Supnithi, Taneth Ruangrajitpakorn, Kanoko...
IJCNLP
2005
Springer
13 years 10 months ago
A Chunking Strategy Towards Unknown Word Detection in Chinese Word Segmentation
This paper proposes a chunking strategy to detect unknown words in Chinese word segmentation. First, a raw sentence is pre-segmented into a sequence of word atoms 1 using a maximum...
Guodong Zhou