TY - GEN
T1 - HCt-SNE
T2 - Hierarchical Constraints with t-SNE
AU - Vu, Viet Minh
AU - Bibal, Adrien
AU - Frénay, Benoît
PY - 2021
Y1 - 2021
N2 - Dimensionality reduction (DR) methods are useful when analyzing high dimensional data, in particular, if one wants to visualize them. t-distributed stochastic neighbor embedding (t-SNE), one of the most widely used DR methods, can preserve neighborhood information and reveal groups in embeddings. However, it may not preserve the global structure and fail to reveal the semantic information in the visualization. From a user point-of-view, a DR visualization is useful if it not only reveals hidden structures in the data but also corresponds to the user knowledge. This paper addresses these problems by proposing Hierarchical Constraint t-SNE (HCt-SNE), a method that allows users to integrate hierarchical constraints directly into t-SNE embeddings. The user constraints are encoded in an explicit tree. We transform the hierarchical information in this tree into a novel regularization term based on triplet constraints among the nodes at different levels in the tree. Our method takes advantage of semantic information provided in class labels and outperforms the original t-SNE and two other supervised DR methods in terms of both visual assessment and quality metrics on three classic image datasets: MNIST, Fashion-MNIST and CIFAR10.
AB - Dimensionality reduction (DR) methods are useful when analyzing high dimensional data, in particular, if one wants to visualize them. t-distributed stochastic neighbor embedding (t-SNE), one of the most widely used DR methods, can preserve neighborhood information and reveal groups in embeddings. However, it may not preserve the global structure and fail to reveal the semantic information in the visualization. From a user point-of-view, a DR visualization is useful if it not only reveals hidden structures in the data but also corresponds to the user knowledge. This paper addresses these problems by proposing Hierarchical Constraint t-SNE (HCt-SNE), a method that allows users to integrate hierarchical constraints directly into t-SNE embeddings. The user constraints are encoded in an explicit tree. We transform the hierarchical information in this tree into a novel regularization term based on triplet constraints among the nodes at different levels in the tree. Our method takes advantage of semantic information provided in class labels and outperforms the original t-SNE and two other supervised DR methods in terms of both visual assessment and quality metrics on three classic image datasets: MNIST, Fashion-MNIST and CIFAR10.
KW - dimensionality reduction
KW - visualization
KW - t-SNE
KW - hierarchical constraint
KW - Triplet loss
M3 - Conference contribution
BT - International Joint Conference on Neural Networks
ER -