Sciweavers

IMCSIT
2010

Quality Benchmarking Relational Databases and Lucene in the TREC4 Adhoc Task Environment

13 years 2 months ago
Quality Benchmarking Relational Databases and Lucene in the TREC4 Adhoc Task Environment
The present work covers a comparison of the text retrieval qualities of open source relational databases and Lucene, which is a full text search engine library, over English documents. TREC-4 adhoc task is completed to compare both search effectiveness and search efficiency. Two relational database management systems and four different well-known English stemming algorithms have been tried. It has been found that language specific preprocessing improves retrieval quality for all systems. The results of the English text retrieval experiments by using Lucene are at par with top six results presented at TREC4 automatic adhoc. Although open source relational databases integrated full text retrieval technology, their relevancy ranking mechanisms are not as good as Lucene's.
Ahmet Arslan, Ozgur Yilmazel
Added 13 Feb 2011
Updated 13 Feb 2011
Type Journal
Year 2010
Where IMCSIT
Authors Ahmet Arslan, Ozgur Yilmazel
Comments (0)