Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

93

CICLING
2005
Springer

favoriteEmaildiscussreport

116views Natural Language Processing» more CICLING 2005»

Name Discrimination by Clustering Similar Contexts

15 years 7 months ago

Name Discrimination by Clustering Similar Contexts

Download www.d.umn.edu

It is relatively common for diﬀerent people or organizations to share the same name. Given the increasing amount of information available online, this results in the ever growing possibility of ﬁnding misleading or incorrect information due to confusion caused by an ambiguous name. This paper presents an unsupervised approach that resolves name ambiguity by clustering the instances of a given name into groups, each of which is associated with a distinct underlying entity. The features we employ to represent the context of an ambiguous name are statistically signiﬁcant bigrams that occur in the same context as the ambiguous name. From these features we create a co–occurrence matrix where the rows and columns represent the ﬁrst and second words in bigrams, and the cells contain their log–likelihood scores. Then we represent each of the contexts in which an ambiguous name appears with a second order context vector. This is created by taking the average of the vectors from the ...

Ted Pedersen, Amruta Purandare, Anagha Kulkarni

Real-time Traffic

Ambiguous Name | CICLING 2005 | Co–occurrence Matrix | Natural Language Processing | Order Context Vectors |

claim paper

Related Content

» Unsupervised Discrimination of Person Names in Web Contexts

» Discriminating Among Word Meanings by Identifying Similar Contexts

» Improved Unsupervised Name Discrimination with Very Wide Bigrams and Automatic Cluster Sto...

» The effect of different context representations on word sense discrimination in biomedical...

» Discriminating Among Word Senses Using McQuittys Similarity Analysis

» Resolving Person Names in Web People Search

» SenseClusters Finding Clusters that Represent Word Senses

» Discovering Relations among Named Entities from Large Corpora

» Semantic Integration in Text From Ambiguous Names to Identifiable Entities

Post Info
More Details (n/a)

Added	26 Jun 2010
Updated	26 Jun 2010
Type	Conference
Year	2005
Where	CICLING
Authors	Ted Pedersen, Amruta Purandare, Anagha Kulkarni

Comments (0)