82524

Автор(ы): 

Автор(ов): 

3

Параметры публикации

Тип публикации: 

Доклад

Название: 

ConvNets Landscape Convergence: Hessian-Based Analysis of Matricized Networks

Электронная публикация: 

Да

ISBN/ISSN: 

2767-9535

DOI: 

10.1109/ispras64596.2024.10899113

Наименование конференции: 

  • 2024 Ivannikov Ispras Open Conference (ISPRAS)

Наименование источника: 

  • Proceedings of the Ivannikov Memorial Workshop (IVMEM), 2024

Город: 

  • Москва

Издательство: 

  • IEEE

Год издания: 

2024

Страницы: 

https://ieeexplore.ieee.org/document/10899113
Аннотация
The Hessian of a neural network is an important aspect for understanding the loss landscape and the characteristic of network architecture. The Hessian matrix captures important information about the curvature, sensitivity, and local behavior of the loss function. Our work proposes a method that enhances the understanding of the local behavior of the loss function and can be used to analyze the behavior of neural networks and also for interpreting the parameters in these networks. In this paper, we consider an approach to investigate the properties of the deep neural network, using the Hessian. We propose a method for estimating the Hessian matrix norm for a specific type of neural networks like convolutional. We have obtained the results for both 1D and 2D convolutions, as well as for the fully connected head in these networks. Our empirical analysis supports these findings, demonstrating convergence in the loss function landscape. We have evaluated the Hessian norm for neural networks represented as a product of matrices and considered how this estimate affects the landscape of the loss function.

Библиографическая ссылка: 

Мешков В.С., Киселев Н.С., Грабовой А.В. ConvNets Landscape Convergence: Hessian-Based Analysis of Matricized Networks / Proceedings of the Ivannikov Memorial Workshop (IVMEM), 2024. М.: IEEE, 2024. С. https://ieeexplore.ieee.org/document/10899113.