Sciweavers

WWW
2011
ACM

Domain-independent entity extraction from web search query logs

12 years 11 months ago
Domain-independent entity extraction from web search query logs
Query logs of a Web search engine have been increasingly used as a vital source for data mining. This paper presents a study on largescale domain-independent entity extraction from search query logs. We present a completely unsupervised method to extract entities by applying pattern-based heuristics and statistical measures. We compare against existing techniques that use Web documents as well as search logs, and show that we improve over the state of the art. We also provide an in-depth qualitative analysis outlining differences and commonalities between these methods. Categories and Subject Descriptors I.2.6 [Artificial Intelligence]: Learning—knowledge acquisition General Terms Algorithms Keywords entity extraction, query logs, data mining
Alpa Jain, Marco Pennacchiotti
Added 15 May 2011
Updated 15 May 2011
Type Journal
Year 2011
Where WWW
Authors Alpa Jain, Marco Pennacchiotti
Comments (0)