Sciweavers

4313 search results - page 191 / 863
» Information Retrieval and the Semantic Web
Sort
View
SIGMOD
1998
ACM
143views Database» more  SIGMOD 1998»
15 years 3 months ago
Interaction of Query Evaluation and Buffer Management for Information Retrieval
The proliferation of the World Wide Web has brought information retrieval (IR) techniques to the forefront of search technology. To the average computer user, “searching” now ...
Björn Þór Jónsson, Michae...
67
Voted
EDBT
2006
ACM
112views Database» more  EDBT 2006»
15 years 11 months ago
Indexing Shared Content in Information Retrieval Systems
Abstract. Modern document collections often contain groups of documents with overlapping or shared content. However, most information retrieval systems process each document separa...
Andrei Z. Broder, Nadav Eiron, Marcus Fontoura, Mi...
CIKM
2003
Springer
15 years 4 months ago
Extracting unstructured data from template generated web documents
We propose a novel approach that identifies web page templates and extracts the unstructured data. Extracting only the body of the page and eliminating the template increases the ...
Ling Ma, Nazli Goharian, Abdur Chowdhury, Misun Ch...
79
Voted
WWW
2008
ACM
15 years 11 months ago
Recrawl scheduling based on information longevity
It is crucial for a web crawler to distinguish between ephemeral and persistent content. Ephemeral content (e.g., quote of the day) is usually not worth crawling, because by the t...
Christopher Olston, Sandeep Pandey
83
Voted
LREC
2008
139views Education» more  LREC 2008»
15 years 10 days ago
Experiments to Investigate the Connection between Case Distribution and Topical Relevance of Search Terms in an Information Retr
We have performed a set of experiments made to investigate the utility of morphological analysis to improve retrieval of documents written in languages with relatively large morph...
Jussi Karlgren, Hercules Dalianis, Bart Jongejan