Spectral Tools for Neural Networks

Lorenzo Giambagli

Spectral Tools for Neural Networks

Technological Platform High Performance Computing

Research output: Contribution to conference › Poster › peer-review

Abstract

Deep Feedforward Neural Networks (FFNNs) play a central role in the Machine Learning field. They are usually trained in the space of nodes, by adjusting the weights of existing links via suitable optimization protocols. Recently a radically new approach has been proposed (1). By anchoring the learning process to reciprocal space, the new targets of the optimization process are eigenvectors and eigenvalues of the transfer operators between layers.

Shifting the focus on such fundamental mathematical structures we have been able to understand their pivotal role in training and analyzing NNs. Indeed, while seeking for a small subset of trainable parameters capable of carrying out the training procedure, eigenvalues are what to look for. Choosing them as trainable parameters allows the optimizer to exploit the parallel adjustment of several weights, the ones underlined by the corresponding eigenvector, and therefore made their after-training interpretation possible.

Firstly, eigenvalues magnitude after the training procedure has occurred has been empirically and heuristically proven being a proxy their relevance in the optimization process. Indeed, a precise correspondence between nodes and eigenvalues can be established, leading to a novel pruning procedure. The nodes related with low magnitude eigenvalues can be removed leading to a fast and easy implemented network compression algorithm.

Secondly, accounting for eigenvalues in the optimization process, it is possible to dynamically train sparse network. Sparsity constrains in the direct space implies that certain weights got filtered under a mask, leading to a gradient equal to zero during the training procedure. Working in the reciprocal space, however, allows masked weights to still be modified, due to the non-local effect of the eigenvalues. Such approach leads to sparse networks whose topology is not fixed to the starting one, resulting in a much more efficient training.

Original language	English
Publication status	Published - 20 Jun 2022
Event	Conference of the Italian Society of Statistical Physics - Nuovo Polo Didattico, Via Kennedy, Parma, Italy Duration: 20 Jun 2022 → 22 Jun 2022 Conference number: 2 https://www.fisicastatistica.org/convegno-sifs

Conference

Conference	Conference of the Italian Society of Statistical Physics
Abbreviated title	SIFS
Country/Territory	Italy
City	Parma
Period	20/06/22 → 22/06/22
Internet address	https://www.fisicastatistica.org/convegno-sifs

1 Article

Spectral pruning of fully connected layers
Giambagli, L., Buffoni, L., Chicchi, L., Civitelli, E. & Fanelli, D., 1 Jul 2022, In: Scientific Reports. 12, 1, 11201.
Research output: Contribution to journal › Article › peer-review

Open Access
File
22 Downloads (Pure)

Best poster prize
Van der Henst, Charles (Recipient), 2010
Prize: Prize (including medals and awards)

Cite this

@conference{0a6b5163d1bb4e73afde8433b290091a,

title = "Spectral Tools for Neural Networks",

abstract = "Deep Feedforward Neural Networks (FFNNs) play a central role in the Machine Learning field. They are usually trained in the space of nodes, by adjusting the weights of existing links via suitable optimization protocols. Recently a radically new approach has been proposed (1). By anchoring the learning process to reciprocal space, the new targets of the optimization process are eigenvectors and eigenvalues of the transfer operators between layers.Shifting the focus on such fundamental mathematical structures we have been able to understand their pivotal role in training and analyzing NNs. Indeed, while seeking for a small subset of trainable parameters capable of carrying out the training procedure, eigenvalues are what to look for. Choosing them as trainable parameters allows the optimizer to exploit the parallel adjustment of several weights, the ones underlined by the corresponding eigenvector, and therefore made their after-training interpretation possible.Firstly, eigenvalues magnitude after the training procedure has occurred has been empirically and heuristically proven being a proxy their relevance in the optimization process. Indeed, a precise correspondence between nodes and eigenvalues can be established, leading to a novel pruning procedure. The nodes related with low magnitude eigenvalues can be removed leading to a fast and easy implemented network compression algorithm.Secondly, accounting for eigenvalues in the optimization process, it is possible to dynamically train sparse network. Sparsity constrains in the direct space implies that certain weights got filtered under a mask, leading to a gradient equal to zero during the training procedure. Working in the reciprocal space, however, allows masked weights to still be modified, due to the non-local effect of the eigenvalues. Such approach leads to sparse networks whose topology is not fixed to the starting one, resulting in a much more efficient training.",

author = "Lorenzo Giambagli",

year = "2022",

month = jun,

day = "20",

language = "English",

note = "Conference of the Italian Society of Statistical Physics, SIFS ; Conference date: 20-06-2022 Through 22-06-2022",

url = "https://www.fisicastatistica.org/convegno-sifs",

}

TY - CONF

T1 - Spectral Tools for Neural Networks

AU - Giambagli, Lorenzo

N1 - Conference code: 2

PY - 2022/6/20

Y1 - 2022/6/20

N2 - Deep Feedforward Neural Networks (FFNNs) play a central role in the Machine Learning field. They are usually trained in the space of nodes, by adjusting the weights of existing links via suitable optimization protocols. Recently a radically new approach has been proposed (1). By anchoring the learning process to reciprocal space, the new targets of the optimization process are eigenvectors and eigenvalues of the transfer operators between layers.Shifting the focus on such fundamental mathematical structures we have been able to understand their pivotal role in training and analyzing NNs. Indeed, while seeking for a small subset of trainable parameters capable of carrying out the training procedure, eigenvalues are what to look for. Choosing them as trainable parameters allows the optimizer to exploit the parallel adjustment of several weights, the ones underlined by the corresponding eigenvector, and therefore made their after-training interpretation possible.Firstly, eigenvalues magnitude after the training procedure has occurred has been empirically and heuristically proven being a proxy their relevance in the optimization process. Indeed, a precise correspondence between nodes and eigenvalues can be established, leading to a novel pruning procedure. The nodes related with low magnitude eigenvalues can be removed leading to a fast and easy implemented network compression algorithm.Secondly, accounting for eigenvalues in the optimization process, it is possible to dynamically train sparse network. Sparsity constrains in the direct space implies that certain weights got filtered under a mask, leading to a gradient equal to zero during the training procedure. Working in the reciprocal space, however, allows masked weights to still be modified, due to the non-local effect of the eigenvalues. Such approach leads to sparse networks whose topology is not fixed to the starting one, resulting in a much more efficient training.

AB - Deep Feedforward Neural Networks (FFNNs) play a central role in the Machine Learning field. They are usually trained in the space of nodes, by adjusting the weights of existing links via suitable optimization protocols. Recently a radically new approach has been proposed (1). By anchoring the learning process to reciprocal space, the new targets of the optimization process are eigenvectors and eigenvalues of the transfer operators between layers.Shifting the focus on such fundamental mathematical structures we have been able to understand their pivotal role in training and analyzing NNs. Indeed, while seeking for a small subset of trainable parameters capable of carrying out the training procedure, eigenvalues are what to look for. Choosing them as trainable parameters allows the optimizer to exploit the parallel adjustment of several weights, the ones underlined by the corresponding eigenvector, and therefore made their after-training interpretation possible.Firstly, eigenvalues magnitude after the training procedure has occurred has been empirically and heuristically proven being a proxy their relevance in the optimization process. Indeed, a precise correspondence between nodes and eigenvalues can be established, leading to a novel pruning procedure. The nodes related with low magnitude eigenvalues can be removed leading to a fast and easy implemented network compression algorithm.Secondly, accounting for eigenvalues in the optimization process, it is possible to dynamically train sparse network. Sparsity constrains in the direct space implies that certain weights got filtered under a mask, leading to a gradient equal to zero during the training procedure. Working in the reciprocal space, however, allows masked weights to still be modified, due to the non-local effect of the eigenvalues. Such approach leads to sparse networks whose topology is not fixed to the starting one, resulting in a much more efficient training.

M3 - Poster

T2 - Conference of the Italian Society of Statistical Physics

Y2 - 20 June 2022 through 22 June 2022

ER -

Spectral Tools for Neural Networks

Abstract

Conference

Research output

Spectral pruning of fully connected layers

Prizes

Best poster prize

Cite this