79938 | ИПУ РАН

Автор(ы):

Русаков К. Д. (ИПУ РАН, Лаборатория 80)

Мамченко М. В. (ИПУ РАН, Лаборатория 80)

Автор(ов):

Параметры публикации

Тип публикации:

Доклад

Название:

Studying Deep Learning Metrics for the Problem of Person’s Emotional State Recognition from Speech

Электронная публикация:

Да

ISBN/ISSN:

979-8-3315-1756-4

DOI:

10.1109/ICCT62929.2024.10874865

Наименование конференции:

8th International Conference on Information, Control, and Communication Technologies (ICCT 2024)

Наименование источника:

Proceedings of 8th International Conference on Information, Control, and Communication Technologies (ICCT 2024)

Город:

Vladikavkaz, Russian Federation

Издательство:

IEEE

Год издания:

2024

Страницы:

https://ieeexplore.ieee.org/document/10874865

Аннотация

The paper studies the metrics of deep learning applied to the problem of emotional state recognition of a person based on voice data. The focus is on the metrics widely used in face recognition such as ArcFace, CosFace, SphereFace, and AM-Softmax, but rarely implemented for speech emotion analysis. The experiments implied a model based on LSTM and CNN, and the use of RAVDESS dataset. It has been established that SphereFace has the highest accuracy during the training and the testing (compared to other metrics) reaching higher values of Top-1 and Top-5 accuracy. The results show that the application of metrics improves the quality of classification of emotional states, opening up prospects for their use in real applications, such as health care and customer service automation.

Библиографическая ссылка:

Русаков К.Д., Мамченко М.В. Studying Deep Learning Metrics for the Problem of Person’s Emotional State Recognition from Speech / Proceedings of 8th International Conference on Information, Control, and Communication Technologies (ICCT 2024). Vladikavkaz, Russian Federation: IEEE, 2024. С. https://ieeexplore.ieee.org/document/10874865.