TY - JOUR
T1 - Constraint Preserving Score for Automatic Hyperparameter Tuning of Dimensionality Reduction Methods for Visualization
AU - Vu, Viet Minh
AU - Bibal, Adrien
AU - Frénay, Benoît
PY - 2021/7/7
Y1 - 2021/7/7
N2 - In data analysis, visualization through dimensionality reduction (DR) is one of the most effective ways to understand a dataset. However, the quality of a visualization is hard to evaluate quantitatively and the hyperparameters of visualization algorithms are sometimes difficult to tune for end-users. This article proposes a score for visualization assessment that can be used to ease the choice of hyperparameter values for widely used DR methods like $t$ -distributed stochastic neighbor embedding, LargeVis, and uniform manifold approximation and projection. We present the constraint preserving score , a computationally efficient score to measure visualization quality. The idea is to measure how well a visualization preserves the information encoded in pairwise constraints like group information or similarity/dissimilarity relationships between instances. Based on this quantitative measure, we use Bayesian optimization to effectively explore the solution space of all visualizations and find the most suitable one. The proposed score is flexible as it can measure quality in different ways depending on the provided constraints. Experiments show its interest for end-users, its complementarity with existing visualization quality measures, and its flexibility to easily express different quality aspects.
AB - In data analysis, visualization through dimensionality reduction (DR) is one of the most effective ways to understand a dataset. However, the quality of a visualization is hard to evaluate quantitatively and the hyperparameters of visualization algorithms are sometimes difficult to tune for end-users. This article proposes a score for visualization assessment that can be used to ease the choice of hyperparameter values for widely used DR methods like $t$ -distributed stochastic neighbor embedding, LargeVis, and uniform manifold approximation and projection. We present the constraint preserving score , a computationally efficient score to measure visualization quality. The idea is to measure how well a visualization preserves the information encoded in pairwise constraints like group information or similarity/dissimilarity relationships between instances. Based on this quantitative measure, we use Bayesian optimization to effectively explore the solution space of all visualizations and find the most suitable one. The proposed score is flexible as it can measure quality in different ways depending on the provided constraints. Experiments show its interest for end-users, its complementarity with existing visualization quality measures, and its flexibility to easily express different quality aspects.
KW - Bayesian Optimization
KW - Dimensionality Reduction
KW - Pairwise Constraints
KW - Visualization
KW - Hyperparameter Tuning
KW - Machine Learning
UR - https://ieeexplore.ieee.org/document/9476903
M3 - Article
SN - 2691-4581
VL - 2
SP - 269
EP - 282
JO - IEEE Transactions on Artificial Intelligence
JF - IEEE Transactions on Artificial Intelligence
IS - 3
ER -