Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

23

EMNLP
2011

favoriteEmaildiscussreport

237views Natural Language Processing» more EMNLP 2011»

Named Entity Recognition in Tweets: An Experimental Study

12 years 4 months ago

Named Entity Recognition in Tweets: An Experimental Study

Download www.cs.washington.edu

People tweet more than 100 Million times daily, yielding a noisy, informal, but sometimes informative corpus of 140-character messages that mirrors the zeitgeist in an unprecedented manner. The performance of standard NLP tools is severely degraded on tweets. This paper addresses this issue by re-building the NLP pipeline beginning with part-of-speech tagging, through chunking, to named-entity recognition. Our novel T-NER system doubles F1 score compared with the Stanford NER system. T-NER leverages the redundancy inherent in tweets to achieve this performance, using LabeledLDA to exploit Freebase dictionaries as a source of distant supervision. LabeledLDA outperforms cotraining, increasing F1 by 25% over ten common entity types. Our NLP tools are available at: http:// github.com/aritter/twitter_nlp

Alan Ritter, Sam Clark, Mausam, Oren Etzioni

Real-time Traffic

EMNLP 2011 | Entity Recognition | Natural Language Processing | NLP Tools | Unprecedented Manner |

claim paper

Related Content

» TwiNER named entity recognition in targeted twitter stream

» Discovering users topics of interest on twitter a first look

» Focused named entity recognition using machine learning

» A scalable machinelearning approach for semistructured named entity recognition

» Efficient combined approach for named entity recognition in spoken language

» Named entity recognition in query

» Chinese Named Entity Recognition with Cascaded Hybrid Model

» Nested Named Entity Recognition in Historical Archive Text

» SVMBased Biological Named Entity Recognition Using Minimum EditDistance Feature Boosted by...

Post Info
More Details (n/a)

Added	20 Dec 2011
Updated	20 Dec 2011
Type	Journal
Year	2011
Where	EMNLP
Authors	Alan Ritter, Sam Clark, Mausam, Oren Etzioni

Comments (0)