A COMPARATIVE STUDY OF INFORMATION RETRIEVAL MODELS: BM25 VERSUS HYBRID RETRIEVAL ON THE CRANFIELD COLLECTION
Authors/Creators
Description
Information retrieval systems face the fundamental challenge of balancing retrieval effectiveness with computational efficiency. Traditional lexical modelslike BM25 provide fast retrieval but may misssemantic relationships,
while modern hybrid approaches combining lexical and semantic methods promise improved effectiveness at the
cost of computational complexity. This study compares the performance of BM25 and hybrid retrieval models
on the Cranfield collection, evaluating both retrieval effectiveness and computational efficiency. We implemented
two retrieval systems: (1) BM25 with pseudo-relevance feedback and tuned parameters (k1 = 1.6, b = 0.4), and
(2) a hybrid model combining BM25, dense retrieval using SPECTER embeddings, Reciprocal Rank Fusion, and
cross-encoder reranking. Both systems were evaluated on 225 Cranfield queries using standard IR metrics:
Precision@10, Recall@20, Mean Average Precision (MAP), and NDCG@20. The hybrid model demonstrated
superior retrieval effectiveness with 41% improvement in Precision@10 (1.07% vs 0.76%), 67% improvement
in Recall@20 (1.54% vs 0.92%), and consistent gains across all metrics. However, BM25 showed dramatically
superior efficiency with 800x faster query processing (15.3ms vs 12.4s average latency). While hybrid retrieval
models achieve better effectiveness, the computational cost may limit their applicability to real-time scenarios.
The Cranfield collection’s inherent difficulty (1960s terminology, sparse relevance judgments) constrains absolute
performance regardless of model sophistication.
Files
JAN19.pdf
Files
(336.3 kB)
| Name | Size | Download all |
|---|---|---|
|
md5:b9fee9a986f5a85c251c1ac55f39bfc4
|
336.3 kB | Preview Download |