Valid interpretation of feature relevance for linear data mappings

Benoît Frénay; Daniela Hofmann; Alexander Schulz; Michael Biehl; Barbara Hammer

doi:10.1109/CIDM.2014.7008661

Valid interpretation of feature relevance for linear data mappings

Benoît Frénay, Daniela Hofmann, Alexander Schulz, Michael Biehl, Barbara Hammer

Faculty of Computer Science

Research output: Contribution in Book/Catalog/Report/Conference proceeding › Conference contribution

Abstract

Linear data transformations constitute essential operations in various machine learning algorithms, ranging from linear regression up to adaptive metric transformation. Often, linear scalings are not only used to improve the model accuracy, rather feature coefficients as provided by the mapping are interpreted as an indicator for the relevance of the feature for the task at hand. This principle, however, can be misleading in particular for high-dimensional or correlated features, since it easily marks irrelevant features as relevant or vice versa. In this contribution, we propose a mathematical formalisation of the minimum and maximum feature relevance for a given linear transformation which can efficiently be solved by means of linear programming. We evaluate the method in several benchmarks, where it becomes apparent that the minimum and maximum relevance closely resembles what is often referred to as weak and strong relevance of the features; hence unlike the mere scaling provided by the linear mapping, it ensures valid interpretability.

Original language	English
Title of host publication	IEEE SSCI 2014 - 2014 IEEE Symposium Series on Computational Intelligence - CIDM 2014: 2014 IEEE Symposium on Computational Intelligence and Data Mining, Proceedings
Publisher	Institute of Electrical and Electronics Engineers Inc.
Pages	149-156
Number of pages	8
ISBN (Print)	9781479945191
DOIs	https://doi.org/10.1109/CIDM.2014.7008661
Publication status	Published - 13 Jan 2015
Event	5th IEEE Symposium on Computational Intelligence and Data Mining, CIDM 2014 - Orlando, United States Duration: 9 Dec 2014 → 12 Dec 2014

Conference

Conference	5th IEEE Symposium on Computational Intelligence and Data Mining, CIDM 2014
Country/Territory	United States
City	Orlando
Period	9/12/14 → 12/12/14

Access to Document

10.1109/CIDM.2014.7008661

Cite this

Frénay, B., Hofmann, D., Schulz, A., Biehl, M., & Hammer, B. (2015). Valid interpretation of feature relevance for linear data mappings. In IEEE SSCI 2014 - 2014 IEEE Symposium Series on Computational Intelligence - CIDM 2014: 2014 IEEE Symposium on Computational Intelligence and Data Mining, Proceedings (pp. 149-156). Article 7008661 Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/CIDM.2014.7008661

Frénay, Benoît ; Hofmann, Daniela ; Schulz, Alexander et al. / Valid interpretation of feature relevance for linear data mappings. IEEE SSCI 2014 - 2014 IEEE Symposium Series on Computational Intelligence - CIDM 2014: 2014 IEEE Symposium on Computational Intelligence and Data Mining, Proceedings. Institute of Electrical and Electronics Engineers Inc., 2015. pp. 149-156

@inproceedings{64aa7c43da93409686581e5209d10fe7,

title = "Valid interpretation of feature relevance for linear data mappings",

abstract = "Linear data transformations constitute essential operations in various machine learning algorithms, ranging from linear regression up to adaptive metric transformation. Often, linear scalings are not only used to improve the model accuracy, rather feature coefficients as provided by the mapping are interpreted as an indicator for the relevance of the feature for the task at hand. This principle, however, can be misleading in particular for high-dimensional or correlated features, since it easily marks irrelevant features as relevant or vice versa. In this contribution, we propose a mathematical formalisation of the minimum and maximum feature relevance for a given linear transformation which can efficiently be solved by means of linear programming. We evaluate the method in several benchmarks, where it becomes apparent that the minimum and maximum relevance closely resembles what is often referred to as weak and strong relevance of the features; hence unlike the mere scaling provided by the linear mapping, it ensures valid interpretability.",

author = "Beno{\^i}t Fr{\'e}nay and Daniela Hofmann and Alexander Schulz and Michael Biehl and Barbara Hammer",

year = "2015",

month = jan,

day = "13",

doi = "10.1109/CIDM.2014.7008661",

language = "English",

isbn = "9781479945191",

pages = "149--156",

booktitle = "IEEE SSCI 2014 - 2014 IEEE Symposium Series on Computational Intelligence - CIDM 2014: 2014 IEEE Symposium on Computational Intelligence and Data Mining, Proceedings",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

address = "United States",

note = "5th IEEE Symposium on Computational Intelligence and Data Mining, CIDM 2014 ; Conference date: 09-12-2014 Through 12-12-2014",

}

Frénay, B, Hofmann, D, Schulz, A, Biehl, M & Hammer, B 2015, Valid interpretation of feature relevance for linear data mappings. in IEEE SSCI 2014 - 2014 IEEE Symposium Series on Computational Intelligence - CIDM 2014: 2014 IEEE Symposium on Computational Intelligence and Data Mining, Proceedings., 7008661, Institute of Electrical and Electronics Engineers Inc., pp. 149-156, 5th IEEE Symposium on Computational Intelligence and Data Mining, CIDM 2014, Orlando, United States, 9/12/14. https://doi.org/10.1109/CIDM.2014.7008661

Valid interpretation of feature relevance for linear data mappings. / Frénay, Benoît; Hofmann, Daniela; Schulz, Alexander et al.
IEEE SSCI 2014 - 2014 IEEE Symposium Series on Computational Intelligence - CIDM 2014: 2014 IEEE Symposium on Computational Intelligence and Data Mining, Proceedings. Institute of Electrical and Electronics Engineers Inc., 2015. p. 149-156 7008661.

Research output: Contribution in Book/Catalog/Report/Conference proceeding › Conference contribution

TY - GEN

T1 - Valid interpretation of feature relevance for linear data mappings

AU - Frénay, Benoît

AU - Hofmann, Daniela

AU - Schulz, Alexander

AU - Biehl, Michael

AU - Hammer, Barbara

PY - 2015/1/13

Y1 - 2015/1/13

N2 - Linear data transformations constitute essential operations in various machine learning algorithms, ranging from linear regression up to adaptive metric transformation. Often, linear scalings are not only used to improve the model accuracy, rather feature coefficients as provided by the mapping are interpreted as an indicator for the relevance of the feature for the task at hand. This principle, however, can be misleading in particular for high-dimensional or correlated features, since it easily marks irrelevant features as relevant or vice versa. In this contribution, we propose a mathematical formalisation of the minimum and maximum feature relevance for a given linear transformation which can efficiently be solved by means of linear programming. We evaluate the method in several benchmarks, where it becomes apparent that the minimum and maximum relevance closely resembles what is often referred to as weak and strong relevance of the features; hence unlike the mere scaling provided by the linear mapping, it ensures valid interpretability.

AB - Linear data transformations constitute essential operations in various machine learning algorithms, ranging from linear regression up to adaptive metric transformation. Often, linear scalings are not only used to improve the model accuracy, rather feature coefficients as provided by the mapping are interpreted as an indicator for the relevance of the feature for the task at hand. This principle, however, can be misleading in particular for high-dimensional or correlated features, since it easily marks irrelevant features as relevant or vice versa. In this contribution, we propose a mathematical formalisation of the minimum and maximum feature relevance for a given linear transformation which can efficiently be solved by means of linear programming. We evaluate the method in several benchmarks, where it becomes apparent that the minimum and maximum relevance closely resembles what is often referred to as weak and strong relevance of the features; hence unlike the mere scaling provided by the linear mapping, it ensures valid interpretability.

UR - http://www.scopus.com/inward/record.url?scp=84925035581&partnerID=8YFLogxK

U2 - 10.1109/CIDM.2014.7008661

DO - 10.1109/CIDM.2014.7008661

M3 - Conference contribution

SN - 9781479945191

SP - 149

EP - 156

BT - IEEE SSCI 2014 - 2014 IEEE Symposium Series on Computational Intelligence - CIDM 2014: 2014 IEEE Symposium on Computational Intelligence and Data Mining, Proceedings

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 5th IEEE Symposium on Computational Intelligence and Data Mining, CIDM 2014

Y2 - 9 December 2014 through 12 December 2014

ER -

Frénay B, Hofmann D, Schulz A, Biehl M, Hammer B. Valid interpretation of feature relevance for linear data mappings. In IEEE SSCI 2014 - 2014 IEEE Symposium Series on Computational Intelligence - CIDM 2014: 2014 IEEE Symposium on Computational Intelligence and Data Mining, Proceedings. Institute of Electrical and Electronics Engineers Inc. 2015. p. 149-156. 7008661 doi: 10.1109/CIDM.2014.7008661

Valid interpretation of feature relevance for linear data mappings

Abstract

Conference

Access to Document

Other files and links

Fingerprint

Cite this