Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

145

WWW
2011
ACM

favoriteEmaildiscussreport

283views Internet Technology» more WWW 2011»

Domain-independent entity extraction from web search query logs

14 years 8 months ago

Domain-independent entity extraction from web search query logs

Download www.www2011india.com

Query logs of a Web search engine have been increasingly used as a vital source for data mining. This paper presents a study on largescale domain-independent entity extraction from search query logs. We present a completely unsupervised method to extract entities by applying pattern-based heuristics and statistical measures. We compare against existing techniques that use Web documents as well as search logs, and show that we improve over the state of the art. We also provide an in-depth qualitative analysis outlining differences and commonalities between these methods. Categories and Subject Descriptors I.2.6 [Artiﬁcial Intelligence]: Learning—knowledge acquisition General Terms Algorithms Keywords entity extraction, query logs, data mining

Alpa Jain, Marco Pennacchiotti

Real-time Traffic

Data Mining | Entity Extraction | Internet Technology | Largescale Domain-independent Entity | WWW 2011 |

claim paper

Related Content

» Open Entity Extraction from Web Search Query Logs

» Automatic extraction of clickable structured web contents for name entity queries

» Identifying comparable entities on the web

» Towards The Web of Concepts Extracting Concepts from Large Datasets

» Extracting newsrelated queries from web query log

» Organizing and searching the world wide web of facts step two harnessing the wisdom of th...

» Investigating the Semantic Gap through Query Log Analysis

» Query Recommendation Using LargeScale Web Access Logs and Web Page Archive

» Automatically Harvesting KatakanaEnglish Term Pairs from Search Engine Query Logs

Post Info
More Details (n/a)

Added	15 May 2011
Updated	15 May 2011
Type	Journal
Year	2011
Where	WWW
Authors	Alpa Jain, Marco Pennacchiotti

Comments (0)