Sciweavers

478 search results - page 29 / 96
» Extracting news-related queries from web query log
Sort
View
EMNLP
2009
14 years 7 months ago
Using Word-Sense Disambiguation Methods to Classify Web Queries by Intent
Three methods are proposed to classify queries by intent (CQI), e.g., navigational, informational, commercial, etc. Following mixed-initiative dialog systems, search engines shoul...
Emily Pitler, Ken Ward Church
WWW
2007
ACM
15 years 10 months ago
EPCI: extracting potentially copyright infringement texts from the web
In this paper, we propose a new system extracting potentially copyright infringement texts from the Web, called EPCI. EPCI extracts them in the following way: (1) generating a set...
Takashi Tashiro, Takanori Ueda, Taisuke Hori, Yu H...
IJSI
2008
115views more  IJSI 2008»
14 years 9 months ago
Towards Knowledge Acquisition from Semi-Structured Content
Abstract A rich family of generic Information Extraction (IE) techniques have been developed by researchers nowadays. This paper proposes WebKER, a system for automatically extract...
Xi Bai, Jigui Sun, Haiyan Che, Lian Shi
SIGMOD
2006
ACM
162views Database» more  SIGMOD 2006»
15 years 10 months ago
Efficient query processing in geographic web search engines
Geographic web search engines allow users to constrain and order search results in an intuitive manner by focusing a query on a particular geographic region. Geographic search tec...
Yen-Yu Chen, Torsten Suel, Alexander Markowetz
ICDE
2004
IEEE
117views Database» more  ICDE 2004»
15 years 11 months ago
Probe, Cluster, and Discover: Focused Extraction of QA-Pagelets from the Deep Web
In this paper, we introduce the concept of a QA-Pagelet to refer to the content region in a dynamic page that contains query matches. We present THOR, a scalable and efficient min...
James Caverlee, Ling Liu, David Buttler