Theoretical and empirical study on the potential inadequacy of mutual information for feature selection in classification

Benoît Frénay; Gauthier Doquire; Michel Verleysen

doi:10.1016/j.neucom.2012.12.051

Theoretical and empirical study on the potential inadequacy of mutual information for feature selection in classification

Benoît Frénay, Gauthier Doquire, Michel Verleysen

Résultats de recherche: Contribution à un journal/une revue › Article › Revue par des pairs

16 Téléchargements (Pure)

Résumé

Mutual information is a widely used performance criterion for filter feature selection. However, despite its popularity and its appealing properties, mutual information is not always the most appropriate criterion. Indeed, contrary to what is sometimes hypothesized in the literature, looking for a feature subset maximizing the mutual information does not always guarantee to decrease the misclassification probability, which is often the objective one is interested in. The first objective of this paper is thus to clearly illustrate this potential inadequacy and to emphasize the fact that the mutual information remains a heuristic, coming with no guarantee in terms of classification accuracy. Through extensive experiments, a deeper analysis of the cases for which the mutual information is not a suitable criterion is then conducted. This analysis allows us to confirm the general interest of the mutual information for feature selection. It also helps us better apprehending the behaviour of mutual information throughout a feature selection process and consequently making a better use of it as a feature selection criterion.

langue originale	Anglais
Pages (de - à)	64-78
Nombre de pages	15
journal	Neurocomputing
Volume	112
Les DOIs	https://doi.org/10.1016/j.neucom.2012.12.051
Etat de la publication	Publié - 18 juil. 2013
Modification externe	Oui

Accès au document

10.1016/j.neucom.2012.12.051

Theoretical and empirical study on the potential inadequacy of mutual information for feature selection in classificationmanuscrit soumis, 1,47 MB

http://ac.els-cdn.com/S0925231213002415/1-s2.0-S0925231213002415-main.pdf?_tid=72006fc2-770c-11e5-9463-00000aab0f02&acdnat=1445333284_38fe1379f0cb28255e263bbea00a4b09

Autres fichiers et liens

Link to publication in Scopus

Contient cette citation

@article{8fc9ae7e8cfc492899859486521886a8,

title = "Theoretical and empirical study on the potential inadequacy of mutual information for feature selection in classification",

abstract = "Mutual information is a widely used performance criterion for filter feature selection. However, despite its popularity and its appealing properties, mutual information is not always the most appropriate criterion. Indeed, contrary to what is sometimes hypothesized in the literature, looking for a feature subset maximizing the mutual information does not always guarantee to decrease the misclassification probability, which is often the objective one is interested in. The first objective of this paper is thus to clearly illustrate this potential inadequacy and to emphasize the fact that the mutual information remains a heuristic, coming with no guarantee in terms of classification accuracy. Through extensive experiments, a deeper analysis of the cases for which the mutual information is not a suitable criterion is then conducted. This analysis allows us to confirm the general interest of the mutual information for feature selection. It also helps us better apprehending the behaviour of mutual information throughout a feature selection process and consequently making a better use of it as a feature selection criterion.",

keywords = "Classification, Feature selection, Hellman-Raviv and Fano bounds, Mutual information, Probability of misclassification",

author = "Beno{\^i}t Fr{\'e}nay and Gauthier Doquire and Michel Verleysen",

year = "2013",

month = jul,

day = "18",

doi = "10.1016/j.neucom.2012.12.051",

language = "English",

volume = "112",

pages = "64--78",

journal = "Neurocomputing",

issn = "0925-2312",

publisher = "Elsevier",

}

TY - JOUR

T1 - Theoretical and empirical study on the potential inadequacy of mutual information for feature selection in classification

AU - Frénay, Benoît

AU - Doquire, Gauthier

AU - Verleysen, Michel

PY - 2013/7/18

Y1 - 2013/7/18

N2 - Mutual information is a widely used performance criterion for filter feature selection. However, despite its popularity and its appealing properties, mutual information is not always the most appropriate criterion. Indeed, contrary to what is sometimes hypothesized in the literature, looking for a feature subset maximizing the mutual information does not always guarantee to decrease the misclassification probability, which is often the objective one is interested in. The first objective of this paper is thus to clearly illustrate this potential inadequacy and to emphasize the fact that the mutual information remains a heuristic, coming with no guarantee in terms of classification accuracy. Through extensive experiments, a deeper analysis of the cases for which the mutual information is not a suitable criterion is then conducted. This analysis allows us to confirm the general interest of the mutual information for feature selection. It also helps us better apprehending the behaviour of mutual information throughout a feature selection process and consequently making a better use of it as a feature selection criterion.

AB - Mutual information is a widely used performance criterion for filter feature selection. However, despite its popularity and its appealing properties, mutual information is not always the most appropriate criterion. Indeed, contrary to what is sometimes hypothesized in the literature, looking for a feature subset maximizing the mutual information does not always guarantee to decrease the misclassification probability, which is often the objective one is interested in. The first objective of this paper is thus to clearly illustrate this potential inadequacy and to emphasize the fact that the mutual information remains a heuristic, coming with no guarantee in terms of classification accuracy. Through extensive experiments, a deeper analysis of the cases for which the mutual information is not a suitable criterion is then conducted. This analysis allows us to confirm the general interest of the mutual information for feature selection. It also helps us better apprehending the behaviour of mutual information throughout a feature selection process and consequently making a better use of it as a feature selection criterion.

KW - Classification

KW - Feature selection

KW - Hellman-Raviv and Fano bounds

KW - Mutual information

KW - Probability of misclassification

UR - http://www.scopus.com/inward/record.url?scp=84877634882&partnerID=8YFLogxK

U2 - 10.1016/j.neucom.2012.12.051

DO - 10.1016/j.neucom.2012.12.051

M3 - Article

SN - 0925-2312

VL - 112

SP - 64

EP - 78

JO - Neurocomputing

JF - Neurocomputing

ER -

Theoretical and empirical study on the potential inadequacy of mutual information for feature selection in classification

Résumé

Accès au document

Autres fichiers et liens

Empreinte digitale

Contient cette citation