Sciweavers

SIGIR
2010
ACM

Incorporating global information into named entity recognition systems using relational context

13 years 2 months ago
Incorporating global information into named entity recognition systems using relational context
The state-of-the-art in Named Entity Recognition relies on a combination of local features of the text and global knowledge to determine the types of the recognized entities. This is problematic in some cases, resulting in entities being classified as belonging to the wrong type. We show that using global information about the corpus improves the accuracy of type identification. We explore the notion of a global domain frequency that relates relationidentifying terms with pairs of entity types which are used in that relation. We use this to identify entities whose types are not compatible with the terms they co-occur in the text. Our results on a large corpus of social media content allows the identification of mistyped entities with 70% accuracy. Categories and Subject Descriptors I.2.7 [Natural Language Processing]: Text analysis General Terms Experimentation, Performance
Yuval Merhav, Filipe de Sá Mesquita, Denils
Added 30 Jan 2011
Updated 30 Jan 2011
Type Journal
Year 2010
Where SIGIR
Authors Yuval Merhav, Filipe de Sá Mesquita, Denilson Barbosa, Wai Gen Yee, Ophir Frieder
Comments (0)