Selective Gradient Boosting for Effective Learning to Rank

doi:10.5281/zenodo.2668014

Published July 12, 2018 | Version v1

Conference paper Open

Selective Gradient Boosting for Effective Learning to Rank

1. UNIVE
2. CNR

Learning an effective ranking function from a large number of query-document examples is a challenging task. Indeed, training sets where queries are associated with a few relevant documents and a large number of irrelevant ones are required to model real scenarios of Web search production systems, where a query can possibly retrieve thousands of matching documents, but only a few of them are actually relevant. In this paper, we propose Selective Gradient Boosting (SelGB), an algorithm addressing the Learning-to-Rank task by focusing on those irrelevant documents that are most likely to be mis-ranked, thus severely hindering the quality of the learned model. SelGB exploits a novel technique minimizing the mis-ranking risk, i.e., the probability that two randomly drawn instances are ranked incorrectly, within a gradient boosting process that iteratively generates an additive ensemble of decision trees. Specifically, at every iteration and on a per query basis, SelGB selectively chooses among the training instances a small sample of negative examples enhancing the discriminative power of the learned model. Reproducible and comprehensive experiments conducted on a publicly available dataset show that SelGB exploits the diversity and variety of the negative examples selected to train tree ensembles that outperform models generated by state-of-the-art algorithms by achieving improvements of NDCG@10 up to 3.2%.

Files

SelectiveGradientBoostingForEffectiveLearningToRank.pdf

Files (1.2 MB)

Name	Size	Download all
SelectiveGradientBoostingForEffectiveLearningToRank.pdf md5:b48c5c4ebf1ed062a7cb0bc2b66511b5	1.2 MB	Preview Download

Additional details

BigDataGrapes – Big Data to Enable Global Disruption of the Grapevine-powered Industries 780751: European Commission

	All versions	This version
Views	89	87
Downloads	150	150
Data volume	189.0 MB	189.0 MB

Selective Gradient Boosting for Effective Learning to Rank

Creators

Description

Files

SelectiveGradientBoostingForEffectiveLearningToRank.pdf

Files (1.2 MB)

Additional details

Funding