Valid interpretation of feature relevance for linear data mappings

Benoît Frénay, Daniela Hofmann, Alexander Schulz, Michael Biehl, Barbara Hammer

Research output: Contribution in Book/Catalog/Report/Conference proceedingConference contribution

Abstract

Linear data transformations constitute essential operations in various machine learning algorithms, ranging from linear regression up to adaptive metric transformation. Often, linear scalings are not only used to improve the model accuracy, rather feature coefficients as provided by the mapping are interpreted as an indicator for the relevance of the feature for the task at hand. This principle, however, can be misleading in particular for high-dimensional or correlated features, since it easily marks irrelevant features as relevant or vice versa. In this contribution, we propose a mathematical formalisation of the minimum and maximum feature relevance for a given linear transformation which can efficiently be solved by means of linear programming. We evaluate the method in several benchmarks, where it becomes apparent that the minimum and maximum relevance closely resembles what is often referred to as weak and strong relevance of the features; hence unlike the mere scaling provided by the linear mapping, it ensures valid interpretability.

Original languageEnglish
Title of host publicationIEEE SSCI 2014 - 2014 IEEE Symposium Series on Computational Intelligence - CIDM 2014: 2014 IEEE Symposium on Computational Intelligence and Data Mining, Proceedings
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages149-156
Number of pages8
ISBN (Print)9781479945191
DOIs
Publication statusPublished - 13 Jan 2015
Event5th IEEE Symposium on Computational Intelligence and Data Mining, CIDM 2014 - Orlando, United States
Duration: 9 Dec 201412 Dec 2014

Conference

Conference5th IEEE Symposium on Computational Intelligence and Data Mining, CIDM 2014
Country/TerritoryUnited States
CityOrlando
Period9/12/1412/12/14

Fingerprint

Dive into the research topics of 'Valid interpretation of feature relevance for linear data mappings'. Together they form a unique fingerprint.

Cite this