Sciweavers

10 search results - page 2 / 2
» DeepBot: a focused crawler for accessing hidden web content
Sort
View
JCDL
2010
ACM
188views Education» more  JCDL 2010»
13 years 10 months ago
Exposing the hidden web for chemical digital libraries
In recent years, the vast amount of digitally available content has lead to the creation of many topic-centered digital libraries. Also in the domain of chemistry more and more di...
Sascha Tönnies, Benjamin Köhncke, Oliver...
WWW
2009
ACM
14 years 5 months ago
Purely URL-based topic classification
Given only the URL of a web page, can we identify its topic? This is the question that we examine in this paper. Usually, web pages are classified using their content [7], but a U...
Eda Baykan, Monika Rauch Henzinger, Ludmila Marian...
SIGIR
2011
ACM
12 years 7 months ago
Learning search tasks in queries and web pages via graph regularization
As the Internet grows explosively, search engines play a more and more important role for users in effectively accessing online information. Recently, it has been recognized that ...
Ming Ji, Jun Yan, Siyu Gu, Jiawei Han, Xiaofei He,...
AAAI
2008
13 years 7 months ago
Extracting Relevant Snippets for Web Navigation
Search engines present fix-length passages from documents ranked by relevance against the query. In this paper, we present and compare novel, language-model based methods for extr...
Qing Li, K. Selçuk Candan, Qi Yan
BMCBI
2008
121views more  BMCBI 2008»
13 years 5 months ago
ReAlignerV: Web-based genomic alignment tool with high specificity and robustness estimated by species-specific insertion sequen
Background: Detecting conserved noncoding sequences (CNSs) across species highlights the functional elements. Alignment procedures combined with computational prediction of transc...
Hisakazu Iwama, Yukio Hori, Kensuke Matsumoto, Koj...