Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

101

NAACL
2010

favoriteEmaildiscussreport

160views Computational Linguistics» more NAACL 2010»

Language identification of names with SVMs

14 years 9 months ago

Language identification of names with SVMs

Download www.aclweb.org

The task of identifying the language of text or utterances has a number of applications in natural language processing. Language identification has traditionally been approached with character-level language models. However, the language model approach crucially depends on the length of the text in question. In this paper, we consider the problem of language identification of names. We show that an approach based on SVMs with n-gram counts as features performs much better than language models. We also experiment with applying the method to pre-process transliteration data for the training of separate models.

Aditya Bhargava, Grzegorz Kondrak

Real-time Traffic

Character-level Language Models | Computational Linguistics | Language Identification | Language Models | NAACL 2010 |

claim paper

Related Content

» Identification of related geneprotein names based on an HMM of name variations

» Finding Ideographic Representations of Japanese Names Written in Latin Script via Language...

» Chinese Named Entity Identification Using Classbased Language Model

» Proper Name Translation in CrossLanguage Information Retrieval

» Generating a Morphological Lexicon of Organization Entity Names

» Robust Reading Identification and Tracing of Ambiguous Names

» Determining the Origin and Structure of Person Names

» Language Identification Strategies for Cross Language Information Retrieval

» The Problems of Language Identification within Hugely Multilingual Data Sets

Post Info
More Details (n/a)

Added	14 Feb 2011
Updated	14 Feb 2011
Type	Journal
Year	2010
Where	NAACL
Authors	Aditya Bhargava, Grzegorz Kondrak

Comments (0)