Sciweavers

219 search results - page 2 / 44
» Web page language identification based on URLs
Sort
View
CN
2006
78views more  CN 2006»
13 years 5 months ago
A short walk in the Blogistan
The increasingly prominent new subset of Web pages, called `blogs' differs from traditional Web pages both in characteristics and potential to applications. We explore three ...
Edith Cohen, Balachander Krishnamurthy
ICDE
2005
IEEE
126views Database» more  ICDE 2005»
13 years 11 months ago
WEBVIGIL: Monitoring Multiple Web Pages and Presentation of XML Pages
In the case of large-scale distributed environments such as the Internet, users are interested in monitoring changes to a particular web page (XML or HTML). There are many instanc...
Shravan Chamakura, Alpa Sachde, Sharma Chakravarth...
SIGIR
2002
ACM
13 years 4 months ago
The Importance of Prior Probabilities for Entry Page Search
An important class of searches on the world-wide-web has the goal to find an entry page (homepage) of an organisation. Entry page search is quite different from Ad Hoc search. Ind...
Wessel Kraaij, Thijs Westerveld, Djoerd Hiemstra
TREC
2004
13 years 6 months ago
Language Models for Searching in Web Corpora
: We describe our participation in the TREC 2004 Web and Terabyte tracks. For the web track, we employ mixture language models based on document full-text, incoming anchortext, and...
Jaap Kamps, Gilad Mishne, Maarten de Rijke
WWW
2001
ACM
14 years 6 months ago
Finding Related Web Pages Based on Connectivity Information from a Search Engine
This paper proposes a method for finding related Web pages based on connectivity information of hyperlinks. As claimed by Kumar, a complete bipartite graph of Web pages can be reg...
Tsuyoshi Murata