The influence of feature selection methods on accuracy, stability and interpretability of molecular signatures - Institut Curie Accéder directement au contenu
Pré-Publication, Document De Travail Année : 2010

The influence of feature selection methods on accuracy, stability and interpretability of molecular signatures

Résumé

Motivation : Biomarker discovery from high-dimensional data is a crucial problem with enormous applications in biology and medicine, such as, e.g. breast cancer prognosis. While it is now common belief that feature selection methods should output both accurate and stable signatures - at least at the functional level -, no study has focused on comparing algorithms in these regards. Methods : Borrowing 4 public datasets, we define systematic procedures to compare a representative panel of 8 feature selection algorithms, among which filters, wrappers and embedded methods in light of predictive performance, stability and functional interpretability of the signatures that they output. We also implemented Ensemble methods and propose to estimate the advantages of using them. Results : We observe that ensemble feature selection techniques have generally no substantial impact on accuracy or stability. Additionally we notice a possible trade-off between stability and accuracy as some methods produce predictive but unstable signatures while others behave in the opposite way. Filter feature selection by a Student's t-test seems to provide a good accuracy/stability trade-off overall.
Fichier principal
Vignette du fichier
techreport2010.pdf (1.65 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-00559580 , version 1 (25-01-2011)
hal-00559580 , version 2 (23-06-2011)

Identifiants

Citer

Anne-Claire Haury, Pierre Gestraud, Jean-Philippe Vert. The influence of feature selection methods on accuracy, stability and interpretability of molecular signatures. 2010. ⟨hal-00559580v1⟩
444 Consultations
1021 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More