Smoothness bias in relevance estimators for feature selection in regression

Alexandra Degeest; Michel Verleysen; Benoît Frénay

doi:10.1007/978-3-319-92007-8_25

Smoothness bias in relevance estimators for feature selection in regression

Alexandra Degeest, Michel Verleysen, Benoît Frénay

Faculty of Computer Science

Research output: Contribution in Book/Catalog/Report/Conference proceeding › Conference contribution

Abstract

Selecting features from high-dimensional datasets is an important problem in machine learning. This paper shows that in the context of filter methods for feature selection, the estimator of the criterion used to select features plays an important role; in particular the estimators may suffer from a bias when comparing smooth and non-smooth features. This paper analyses the origin of such bias and investigates whether this bias influences the results of the feature selection process. Results show that non-smooth features tend to be penalised especially in small datasets.

Original language	English
Title of host publication	Artificial Intelligence Applications and Innovations - 14th IFIP WG 12.5 International Conference, AIAI 2018, Proceedings
Editors	Ilias Maglogiannis, Lazaros Iliadis, Vassilis Plagianakos
Publisher	Springer New York
Pages	285-294
Number of pages	10
ISBN (Print)	9783319920061
DOIs	https://doi.org/10.1007/978-3-319-92007-8_25
Publication status	Published - 1 Jan 2018
Event	14th IFIP WG 12.5 International Conference on Artificial Intelligence Applications and Innovations, AIAI 2018 - Rhodes, Greece Duration: 25 May 2018 → 27 May 2018

Publication series

Name	IFIP Advances in Information and Communication Technology
Volume	519
ISSN (Print)	1868-4238

Conference

Conference	14th IFIP WG 12.5 International Conference on Artificial Intelligence Applications and Innovations, AIAI 2018
Country/Territory	Greece
City	Rhodes
Period	25/05/18 → 27/05/18

Keywords

Feature selection
Filter methods
Mutual information
Noise variance
Smoothness

Access to Document

10.1007/978-3-319-92007-8_25

Cite this

Degeest, A., Verleysen, M., & Frénay, B. (2018). Smoothness bias in relevance estimators for feature selection in regression. In I. Maglogiannis, L. Iliadis, & V. Plagianakos (Eds.), Artificial Intelligence Applications and Innovations - 14th IFIP WG 12.5 International Conference, AIAI 2018, Proceedings (pp. 285-294). (IFIP Advances in Information and Communication Technology; Vol. 519). Springer New York. https://doi.org/10.1007/978-3-319-92007-8_25

Degeest, Alexandra ; Verleysen, Michel ; Frénay, Benoît. / Smoothness bias in relevance estimators for feature selection in regression. Artificial Intelligence Applications and Innovations - 14th IFIP WG 12.5 International Conference, AIAI 2018, Proceedings. editor / Ilias Maglogiannis ; Lazaros Iliadis ; Vassilis Plagianakos. Springer New York, 2018. pp. 285-294 (IFIP Advances in Information and Communication Technology).

@inproceedings{e876a2ca587d4595b66a9e215301d998,

title = "Smoothness bias in relevance estimators for feature selection in regression",

abstract = "Selecting features from high-dimensional datasets is an important problem in machine learning. This paper shows that in the context of filter methods for feature selection, the estimator of the criterion used to select features plays an important role; in particular the estimators may suffer from a bias when comparing smooth and non-smooth features. This paper analyses the origin of such bias and investigates whether this bias influences the results of the feature selection process. Results show that non-smooth features tend to be penalised especially in small datasets.",

keywords = "Feature selection, Filter methods, Mutual information, Noise variance, Smoothness",

author = "Alexandra Degeest and Michel Verleysen and Beno{\^i}t Fr{\'e}nay",

year = "2018",

month = jan,

day = "1",

doi = "10.1007/978-3-319-92007-8_25",

language = "English",

isbn = "9783319920061",

series = "IFIP Advances in Information and Communication Technology",

publisher = "Springer New York",

pages = "285--294",

editor = "Ilias Maglogiannis and Lazaros Iliadis and Vassilis Plagianakos",

booktitle = "Artificial Intelligence Applications and Innovations - 14th IFIP WG 12.5 International Conference, AIAI 2018, Proceedings",

address = "United States",

note = "14th IFIP WG 12.5 International Conference on Artificial Intelligence Applications and Innovations, AIAI 2018 ; Conference date: 25-05-2018 Through 27-05-2018",

}

Degeest, A, Verleysen, M & Frénay, B 2018, Smoothness bias in relevance estimators for feature selection in regression. in I Maglogiannis, L Iliadis & V Plagianakos (eds), Artificial Intelligence Applications and Innovations - 14th IFIP WG 12.5 International Conference, AIAI 2018, Proceedings. IFIP Advances in Information and Communication Technology, vol. 519, Springer New York, pp. 285-294, 14th IFIP WG 12.5 International Conference on Artificial Intelligence Applications and Innovations, AIAI 2018, Rhodes, Greece, 25/05/18. https://doi.org/10.1007/978-3-319-92007-8_25

Smoothness bias in relevance estimators for feature selection in regression. / Degeest, Alexandra; Verleysen, Michel; Frénay, Benoît.
Artificial Intelligence Applications and Innovations - 14th IFIP WG 12.5 International Conference, AIAI 2018, Proceedings. ed. / Ilias Maglogiannis; Lazaros Iliadis; Vassilis Plagianakos. Springer New York, 2018. p. 285-294 (IFIP Advances in Information and Communication Technology; Vol. 519).

Research output: Contribution in Book/Catalog/Report/Conference proceeding › Conference contribution

TY - GEN

T1 - Smoothness bias in relevance estimators for feature selection in regression

AU - Degeest, Alexandra

AU - Verleysen, Michel

AU - Frénay, Benoît

PY - 2018/1/1

Y1 - 2018/1/1

N2 - Selecting features from high-dimensional datasets is an important problem in machine learning. This paper shows that in the context of filter methods for feature selection, the estimator of the criterion used to select features plays an important role; in particular the estimators may suffer from a bias when comparing smooth and non-smooth features. This paper analyses the origin of such bias and investigates whether this bias influences the results of the feature selection process. Results show that non-smooth features tend to be penalised especially in small datasets.

AB - Selecting features from high-dimensional datasets is an important problem in machine learning. This paper shows that in the context of filter methods for feature selection, the estimator of the criterion used to select features plays an important role; in particular the estimators may suffer from a bias when comparing smooth and non-smooth features. This paper analyses the origin of such bias and investigates whether this bias influences the results of the feature selection process. Results show that non-smooth features tend to be penalised especially in small datasets.

KW - Feature selection

KW - Filter methods

KW - Mutual information

KW - Noise variance

KW - Smoothness

UR - http://www.scopus.com/inward/record.url?scp=85049598346&partnerID=8YFLogxK

U2 - 10.1007/978-3-319-92007-8_25

DO - 10.1007/978-3-319-92007-8_25

M3 - Conference contribution

AN - SCOPUS:85049598346

SN - 9783319920061

T3 - IFIP Advances in Information and Communication Technology

SP - 285

EP - 294

BT - Artificial Intelligence Applications and Innovations - 14th IFIP WG 12.5 International Conference, AIAI 2018, Proceedings

A2 - Maglogiannis, Ilias

A2 - Iliadis, Lazaros

A2 - Plagianakos, Vassilis

PB - Springer New York

T2 - 14th IFIP WG 12.5 International Conference on Artificial Intelligence Applications and Innovations, AIAI 2018

Y2 - 25 May 2018 through 27 May 2018

ER -

Degeest A, Verleysen M, Frénay B. Smoothness bias in relevance estimators for feature selection in regression. In Maglogiannis I, Iliadis L, Plagianakos V, editors, Artificial Intelligence Applications and Innovations - 14th IFIP WG 12.5 International Conference, AIAI 2018, Proceedings. Springer New York. 2018. p. 285-294. (IFIP Advances in Information and Communication Technology). doi: 10.1007/978-3-319-92007-8_25

Smoothness bias in relevance estimators for feature selection in regression

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this