A Boosting Algorithm for Learning Bipartite Ranking Functions with Partially Labeled Data - Université Pierre et Marie Curie Accéder directement au contenu
Communication Dans Un Congrès Année : 2008

A Boosting Algorithm for Learning Bipartite Ranking Functions with Partially Labeled Data

Cyril Goutte
  • Fonction : Auteur

Résumé

This paper presents a boosting based algorithm for learning a bipartite ranking function (BRF) with partially labeled data. Until now different attempts had been made to build a BRF in a transductive setting, in which the test points are given to the methods in advance as unlabeled data. The proposed approach is a semi-supervised inductive ranking algorithm which, as opposed to transductive algorithms, is able to infer an ordering on new examples that were not used for its training. We evaluate our approach using the TREC-9 Ohsumed and the Reuters-21578 data collections, comparing against two semi-supervised classification algorithms for ROCArea (AUC), uninterpolated average precision (AUP), mean precision@50 (TP) and Precision-Recall (PR) curves. In the most interesting cases where there are an unbalanced number of irrelevant examples over relevant ones, we show our method to produce statistically significant improvements with respect to these ranking measures.

Dates et versions

hal-01301620 , version 1 (12-04-2016)

Identifiants

Citer

Massih-Reza Amini, Tuong Vinh Truong, Cyril Goutte. A Boosting Algorithm for Learning Bipartite Ranking Functions with Partially Labeled Data. 31st International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2008), Jul 2008, Singapore, Singapore. pp.99-106, ⟨10.1145/1390334.1390354⟩. ⟨hal-01301620⟩
51 Consultations
0 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More