Sciweavers

938 search results - page 85 / 188
» Space-Efficient Algorithms for Document Retrieval
Sort
View
LREC
2010
189views Education» more  LREC 2010»
15 years 1 months ago
Automatic Acquisition of Parallel Corpora from Websites with Dynamic Content
Parallel corpora are indispensable resources for a variety of multilingual natural language processing tasks. This paper presents a technique for fully automatic construction of c...
Yulia Tsvetkov, Shuly Wintner
SIGIR
2009
ACM
15 years 6 months ago
Approximating true relevance distribution from a mixture model based on irrelevance data
Pseudo relevance feedback (PRF), which has been widely applied in IR, aims to derive a distribution from the top n pseudo relevant documents D. However, these documents are often ...
Peng Zhang, Yuexian Hou, Dawei Song
CIKM
2007
Springer
15 years 6 months ago
Query expansion using probabilistic local feedback with application to multimedia retrieval
As one of the most effective query expansion approaches, local feedback is able to automatically discover new query terms and improve retrieval accuracy for different retrieval ...
Rong Yan, Alexander G. Hauptmann
SIGIR
2010
ACM
14 years 6 months ago
Efficient partial-duplicate detection based on sequence matching
With the ever-increasing growth of the Internet, numerous copies of documents become serious problem for search engine, opinion mining and many other web applications. Since parti...
Qi Zhang, Yue Zhang, Haomin Yu, Xuanjing Huang
VLDB
2002
ACM
97views Database» more  VLDB 2002»
14 years 11 months ago
Incorporating XSL Processing into Database Engines
The two observations that 1) many XML documents are stored in a database or generated from data stored in a database and 2) processing these documents with XSL stylesheet processo...
Guido Moerkotte