Sciweavers

240 search results - page 39 / 48
» Language Identification of Search Engine Queries
Sort
View
WWW
2010
ACM
15 years 4 months ago
Not so creepy crawler: easy crawler generation with standard xml queries
Web crawlers are increasingly used for focused tasks such as the extraction of data from Wikipedia or the analysis of social networks like last.fm. In these cases, pages are far m...
Franziska von dem Bussche, Klara A. Weiand, Benedi...
CIDR
2009
129views Algorithms» more  CIDR 2009»
14 years 11 months ago
Extracting and Querying a Comprehensive Web Database
Recent research in domain-independent information extraction holds the promise of an automatically-constructed structured database derived from the Web. A query system based on th...
Michael J. Cafarella
ICMI
2005
Springer
193views Biometrics» more  ICMI 2005»
15 years 3 months ago
Augmenting conversational dialogue by means of latent semantic googling
This paper presents Latent Semantic Googling, a variant of Landauer’s Latent Semantic Indexing that uses the Google search engine to judge the semantic closeness of sets of word...
Robin Senior, Roel Vertegaal
PVLDB
2008
124views more  PVLDB 2008»
14 years 9 months ago
Google's Deep Web crawl
The Deep Web, i.e., content hidden behind HTML forms, has long been acknowledged as a significant gap in search engine coverage. Since it represents a large portion of the structu...
Jayant Madhavan, David Ko, Lucja Kot, Vignesh Gana...
DAGSTUHL
2007
14 years 11 months ago
Calculus and Algebra for Distributed Data Management
Abstract. The sharing of content by communities of users (e.g., scientists) in a P2P context remains cumbersome. We argue that main reasons for this is the lack of calculus and alg...
Serge Abiteboul