Sciweavers

3069 search results - page 360 / 614
» Thinking about Technology
Sort
View
APWEB
2004
Springer
15 years 10 months ago
A Query-Dependent Duplicate Detection Approach for Large Scale Search Engines
Duplication of Web pages greatly hurts the perceived relevance of a search engine. Existing methods for detecting duplicated Web pages can be classified into two categories, i.e. o...
Shaozhi Ye, Ruihua Song, Ji-Rong Wen, Wei-Ying Ma
AIRWEB
2006
Springer
15 years 10 months ago
Tracking Web Spam with Hidden Style Similarity
Automatically generated content is ubiquitous in the web: dynamic sites built using the three-tier paradigm are good examples (e.g. commercial sites, blogs and other sites powered...
Tanguy Urvoy, Thomas Lavergne, Pascal Filoche
APWEB
2006
Springer
15 years 10 months ago
Automatically Constructing Descriptive Site Maps
Rapid increase in the number of pages on web sites, and widespread use of search engine optimization techniques, lead to web sites becoming difficult to navigate. Traditional site ...
Pavel Dmitriev, Carl Lagoze
APWEB
2006
Springer
15 years 10 months ago
Sample Sizes for Query Probing in Uncooperative Distributed Information Retrieval
The goal of distributed information retrieval is to support effective searching over multiple document collections. For efficiency, queries should be routed to only those collectio...
Milad Shokouhi, Falk Scholer, Justin Zobel
ESWS
2006
Springer
15 years 10 months ago
PowerAqua: Fishing the Semantic Web
The Semantic Web (SW) offers an opportunity to develop novel, sophisticated forms of question answering (QA). Specifically, the availability of distributed semantic markup on a lar...
Vanessa Lopez, Enrico Motta, Victoria S. Uren