Sciweavers

1014 search results - page 35 / 203
» Using Keyword Extraction for Web Site Clustering
Sort
View
CIKM
2008
Springer
14 years 11 months ago
Coreex: content extraction from online news articles
We developed and tested a heuristic technique for extracting the main article from news site Web pages. We construct the DOM tree of the page and score every node based on the amo...
Jyotika Prasad, Andreas Paepcke
75
Voted
CL
2000
Springer
15 years 2 months ago
Design and Implementation of the Physical Layer in WebBases: The XRover Experience
Webbases are database systems that enable creation of Web applications that allow end users to shop around for products and services at various Web sites without having to manually...
Hasan Davulcu, Guizhen Yang, Michael Kifer, I. V. ...
COLING
2010
14 years 4 months ago
Open Entity Extraction from Web Search Query Logs
In this paper we propose a completely unsupervised method for open-domain entity extraction and clustering over query logs. The underlying hypothesis is that classes defined by mi...
Alpa Jain, Marco Pennacchiotti
TREC
2001
14 years 11 months ago
SiteQ: Engineering High Performance QA System Using Lexico-Semantic Pattern Matching and Shallow NLP
s In TREC-10, we participated in the web track (only ad-hoc task) and the QA track (only main task). In the QA track, our QA system (SiteQ) has general architecture with three proc...
Gary Geunbae Lee, Jungyun Seo, Seungwoo Lee, Hanmi...
KES
2008
Springer
14 years 9 months ago
Data Mining for Navigation Generating System with Unorganized Web Resources
Users prefer to navigate subjects from organized topics in an abundance resources than to list pages retrieved from search engines. We propose a framework to cluster frequent items...
Diana Purwitasari, Yasuhisa Okazaki, Kenzi Watanab...