Sciweavers

114 search results - page 20 / 23
» Estimating the impressionrank of web pages
Sort
View
WWW
2008
ACM
14 years 6 months ago
Modeling anchor text and classifying queries to enhance web document retrieval
Several types of queries are widely used on the World Wide Web and the expected retrieval method can vary depending on the query type. We propose a method for classifying queries ...
Atsushi Fujii
SIGMOD
2009
ACM
140views Database» more  SIGMOD 2009»
14 years 26 days ago
Robust web extraction: an approach based on a probabilistic tree-edit model
On script-generated web sites, many documents share common HTML tree structure, allowing wrappers to effectively extract information of interest. Of course, the scripts and thus ...
Nilesh N. Dalvi, Philip Bohannon, Fei Sha
SIGIR
2005
ACM
13 years 11 months ago
Improving collection selection with overlap awareness in P2P search engines
Collection selection has been a research issue for years. Typically, in related work, precomputed statistics are employed in order to estimate the expected result quality of each ...
Matthias Bender, Sebastian Michel, Peter Triantafi...
PRL
2010
149views more  PRL 2010»
13 years 25 days ago
Adaptive linear models for regression: Improving prediction when population has changed
The general setting of regression analysis is to identify a relationship between a response variable Y and one or several explanatory variables X by using a learning sample. In a ...
Charles Bouveyron, Julien Jacques
ECIR
2009
Springer
14 years 3 months ago
Correlation of Term Count and Document Frequency for Google N-Grams
For bounded datasets such as the TREC Web Track (WT10g) the computation of term frequency (TF) and inverse document frequency (IDF) is not difficult. However, when the corpus is th...
Martin Klein, Michael L. Nelson