Sciweavers

1109 search results - page 40 / 222
» Crawling on web graphs
Sort
View
SPIRE
1999
Springer
15 years 6 months ago
CoBWeb - A Crawler for the Brazilian Web
One of the key components of current Web search engines is the document collector. This paper describes CoBWeb, an automatic document collector, whose architecture is distributed ...
Altigran Soares da Silva, Eveline A. Veloso, Paulo...
EACL
2006
ACL Anthology
15 years 3 months ago
Large Linguistically-Processed Web Corpora for Multiple Languages
The Web contains vast amounts of linguistic data. One key issue for linguists and language technologists is how to access it. Commercial search engines give highly compromised acc...
Marco Baroni, Adam Kilgarriff
133
Voted
SIGIR
2012
ACM
13 years 4 months ago
Creating temporally dynamic web search snippets
Content on the Internet is always changing. We explore the value of biasing search result snippets towards new webpage content. We present results from a user study comparing trad...
Krysta Marie Svore, Jaime Teevan, Susan T. Dumais,...
WWW
2003
ACM
15 years 7 months ago
AnswerBus News Engine
AnswerBus News Engine1 is a question answering system using the contents of CNN Web site2 as its knowledge base. Comparing to other question answering systems including its previo...
Zhiping Zheng
110
Voted
CLEF
2010
Springer
15 years 3 months ago
MapReduce for Information Retrieval Evaluation: "Let's Quickly Test This on 12 TB of Data"
We propose to use MapReduce to quickly test new retrieval approaches on a cluster of machines by sequentially scanning all documents. We present a small case study in which we use ...
Djoerd Hiemstra, Claudia Hauff