Sciweavers

1098 search results - page 95 / 220
» Compressed web indexes
Sort
View
ECIR
2011
Springer
14 years 1 months ago
SkipBlock: Self-indexing for Block-Based Inverted List
In large web search engines the performance of Information Retrieval systems is a key issue. Block-based compression methods are often used to improve the search performance, but c...
Stéphane Campinas, Renaud Delbru, Giovanni ...
BIRTHDAY
2010
Springer
15 years 4 months ago
Extended Compact Web Graph Representations
Abstract. Many relevant Web mining tasks translate into classical algorithms on the Web graph. Compact Web graph representations allow running these tasks on larger graphs within m...
Francisco Claude, Gonzalo Navarro
PVLDB
2008
124views more  PVLDB 2008»
14 years 9 months ago
Google's Deep Web crawl
The Deep Web, i.e., content hidden behind HTML forms, has long been acknowledged as a significant gap in search engine coverage. Since it represents a large portion of the structu...
Jayant Madhavan, David Ko, Lucja Kot, Vignesh Gana...
JCDL
2006
ACM
167views Education» more  JCDL 2006»
15 years 4 months ago
Combining DOM tree and geometric layout analysis for online medical journal article segmentation
We describe an HTML web page segmentation algorithm, which is applied to segment online medical journal articles (regular HTML and PDF-Converted-HTML files). The web page content ...
Jie Zou, Daniel X. Le, George R. Thoma
WWW
2009
ACM
15 years 10 months ago
Large scale integration of senses for the semantic web
Nowadays, the increasing amount of semantic data available on the Web leads to a new stage in the potential of Semantic Web applications. However, it also introduces new issues du...
Jorge Gracia, Mathieu d'Aquin, Eduardo Mena