Globally local and fast explanations of t-SNE-like nonlinear embeddings

Pierre Lambert; Rebecca Marion; Julien Albert; Emmanuel Jean; Sacha Corbugy; Cyril de Bodt

Globally local and fast explanations of t-SNE-like nonlinear embeddings

Pierre Lambert, Rebecca Marion, Julien Albert, Emmanuel Jean, Sacha Corbugy, Cyril de Bodt

Research output: Contribution in Book/Catalog/Report/Conference proceeding › Chapter (peer-reviewed) › peer-review

Abstract

Nonlinear dimensionality reduction (NLDR) algorithms such as t-SNE are often employed to visually analyze high-dimensional (HD) data sets in the form of low-dimensional (LD) embeddings. Unfortunately, the nonlinearity of the NLDR process prohibits the interpretation of the resulting embeddings in terms of the HD features. State-of-the-art studies propose post-hoc explanation approaches to locally explain the embeddings. However, such tools are typically slow and do not automatically cover the entire LD embedding, instead providing local explanations around one selected data point at a time. This prevents users from quickly gaining insights about the general explainability landscape of the embedding. This paper presents a globally local and fast explanation framework for NLDR embeddings. This framework is fast because it only requires the computation of sparse linear regression models on subsets of the data, without ever reapplying the NLDR algorithm itself. In addition, the framework is globally local in the sense that the entire LD embedding is automatically covered by multiple local explanations. The different interpretable structures in the embedding are directly characterized, making it possible to quantify the importance of the HD features in various regions of the LD embedding. An example use-case is examined, emphasizing the value of the presented framework. Public codes and a software are available at https://github.com/PierreLambert3/glocally_explained.

Original language	English
Title of host publication	CIKM-WS 2022
Subtitle of host publication	Proceedings of the CIKM 2022 Workshops
Editors	Georgios Drakopoulos, Eleanna Kafeza
Publisher	CEUR Workshop Proceedings
Publication status	Published - 2022
Event	2022 International Conference on Information and Knowledge Management Workshops, CIKM-WS 2022 - Atlanta, United States Duration: 17 Oct 2022 → 21 Oct 2022

Publication series

Name	CEUR Workshop Proceedings
Publisher	CEUR-WS
Volume	3318
ISSN (Print)	1613-0073

Conference

Conference	2022 International Conference on Information and Knowledge Management Workshops, CIKM-WS 2022
Country/Territory	United States
City	Atlanta
Period	17/10/22 → 21/10/22

Keywords

data exploration
data visualization
dimensionality reduction
explainability
interactivity
interpretability
t-SNE

Cite this

@inbook{16274cab1c594a4180e01bf2e35acae8,

title = "Globally local and fast explanations of t-SNE-like nonlinear embeddings",

abstract = "Nonlinear dimensionality reduction (NLDR) algorithms such as t-SNE are often employed to visually analyze high-dimensional (HD) data sets in the form of low-dimensional (LD) embeddings. Unfortunately, the nonlinearity of the NLDR process prohibits the interpretation of the resulting embeddings in terms of the HD features. State-of-the-art studies propose post-hoc explanation approaches to locally explain the embeddings. However, such tools are typically slow and do not automatically cover the entire LD embedding, instead providing local explanations around one selected data point at a time. This prevents users from quickly gaining insights about the general explainability landscape of the embedding. This paper presents a globally local and fast explanation framework for NLDR embeddings. This framework is fast because it only requires the computation of sparse linear regression models on subsets of the data, without ever reapplying the NLDR algorithm itself. In addition, the framework is globally local in the sense that the entire LD embedding is automatically covered by multiple local explanations. The different interpretable structures in the embedding are directly characterized, making it possible to quantify the importance of the HD features in various regions of the LD embedding. An example use-case is examined, emphasizing the value of the presented framework. Public codes and a software are available at https://github.com/PierreLambert3/glocally_explained.",

keywords = "data exploration, data visualization, dimensionality reduction, explainability, interactivity, interpretability, t-SNE",

author = "Pierre Lambert and Rebecca Marion and Julien Albert and Emmanuel Jean and Sacha Corbugy and {de Bodt}, Cyril",

note = "Funding Information: This work was supported by Service Public de Wallonie Recherche under grant n° 2010235-ARIAC by DIGITAL-WALLONIA4.AI. SC is supported by a FRIA grant (F.R.S.-FNRS). Publisher Copyright: {\textcopyright} 2022 Copyright for this paper by its authors.; 2022 International Conference on Information and Knowledge Management Workshops, CIKM-WS 2022 ; Conference date: 17-10-2022 Through 21-10-2022",

year = "2022",

language = "English",

series = "CEUR Workshop Proceedings",

publisher = "CEUR Workshop Proceedings",

editor = "Georgios Drakopoulos and Eleanna Kafeza",

booktitle = "CIKM-WS 2022",

}

Lambert, P, Marion, R , Albert, J, Jean, E, Corbugy, S & de Bodt, C 2022, Globally local and fast explanations of t-SNE-like nonlinear embeddings. in G Drakopoulos & E Kafeza (eds), CIKM-WS 2022: Proceedings of the CIKM 2022 Workshops. CEUR Workshop Proceedings, vol. 3318, CEUR Workshop Proceedings, 2022 International Conference on Information and Knowledge Management Workshops, CIKM-WS 2022, Atlanta, United States, 17/10/22.

Globally local and fast explanations of t-SNE-like nonlinear embeddings. / Lambert, Pierre; Marion, Rebecca ; Albert, Julien et al.
CIKM-WS 2022: Proceedings of the CIKM 2022 Workshops. ed. / Georgios Drakopoulos; Eleanna Kafeza. CEUR Workshop Proceedings, 2022. (CEUR Workshop Proceedings; Vol. 3318).

Research output: Contribution in Book/Catalog/Report/Conference proceeding › Chapter (peer-reviewed) › peer-review

TY - CHAP

T1 - Globally local and fast explanations of t-SNE-like nonlinear embeddings

AU - Lambert, Pierre

AU - Marion, Rebecca

AU - Albert, Julien

AU - Jean, Emmanuel

AU - Corbugy, Sacha

AU - de Bodt, Cyril

N1 - Funding Information: This work was supported by Service Public de Wallonie Recherche under grant n° 2010235-ARIAC by DIGITAL-WALLONIA4.AI. SC is supported by a FRIA grant (F.R.S.-FNRS). Publisher Copyright: © 2022 Copyright for this paper by its authors.

PY - 2022

Y1 - 2022

N2 - Nonlinear dimensionality reduction (NLDR) algorithms such as t-SNE are often employed to visually analyze high-dimensional (HD) data sets in the form of low-dimensional (LD) embeddings. Unfortunately, the nonlinearity of the NLDR process prohibits the interpretation of the resulting embeddings in terms of the HD features. State-of-the-art studies propose post-hoc explanation approaches to locally explain the embeddings. However, such tools are typically slow and do not automatically cover the entire LD embedding, instead providing local explanations around one selected data point at a time. This prevents users from quickly gaining insights about the general explainability landscape of the embedding. This paper presents a globally local and fast explanation framework for NLDR embeddings. This framework is fast because it only requires the computation of sparse linear regression models on subsets of the data, without ever reapplying the NLDR algorithm itself. In addition, the framework is globally local in the sense that the entire LD embedding is automatically covered by multiple local explanations. The different interpretable structures in the embedding are directly characterized, making it possible to quantify the importance of the HD features in various regions of the LD embedding. An example use-case is examined, emphasizing the value of the presented framework. Public codes and a software are available at https://github.com/PierreLambert3/glocally_explained.

AB - Nonlinear dimensionality reduction (NLDR) algorithms such as t-SNE are often employed to visually analyze high-dimensional (HD) data sets in the form of low-dimensional (LD) embeddings. Unfortunately, the nonlinearity of the NLDR process prohibits the interpretation of the resulting embeddings in terms of the HD features. State-of-the-art studies propose post-hoc explanation approaches to locally explain the embeddings. However, such tools are typically slow and do not automatically cover the entire LD embedding, instead providing local explanations around one selected data point at a time. This prevents users from quickly gaining insights about the general explainability landscape of the embedding. This paper presents a globally local and fast explanation framework for NLDR embeddings. This framework is fast because it only requires the computation of sparse linear regression models on subsets of the data, without ever reapplying the NLDR algorithm itself. In addition, the framework is globally local in the sense that the entire LD embedding is automatically covered by multiple local explanations. The different interpretable structures in the embedding are directly characterized, making it possible to quantify the importance of the HD features in various regions of the LD embedding. An example use-case is examined, emphasizing the value of the presented framework. Public codes and a software are available at https://github.com/PierreLambert3/glocally_explained.

KW - data exploration

KW - data visualization

KW - dimensionality reduction

KW - explainability

KW - interactivity

KW - interpretability

KW - t-SNE

UR - http://www.scopus.com/inward/record.url?scp=85146253149&partnerID=8YFLogxK

M3 - Chapter (peer-reviewed)

AN - SCOPUS:85146253149

T3 - CEUR Workshop Proceedings

BT - CIKM-WS 2022

A2 - Drakopoulos, Georgios

A2 - Kafeza, Eleanna

PB - CEUR Workshop Proceedings

T2 - 2022 International Conference on Information and Knowledge Management Workshops, CIKM-WS 2022

Y2 - 17 October 2022 through 21 October 2022

ER -

Globally local and fast explanations of t-SNE-like nonlinear embeddings

Abstract

Publication series

Conference

Keywords

Other files and links

Fingerprint

ARIAC by DigitalWallonia4.AI: Applications and Research for Trusted Artificial Intelligence (TRAIL-Foundations)

Cite this

Globally local and fast explanations of t-SNE-like nonlinear embeddings

Abstract

Publication series

Conference

Keywords

Other files and links

Fingerprint

Projects

ARIAC by DigitalWallonia4.AI: Applications and Research for Trusted Artificial Intelligence (TRAIL-Foundations)

Cite this