60068

Автор(ы): 

Автор(ов): 

3

Параметры публикации

Тип публикации: 

Статья в журнале/сборнике

Название: 

An Overview of Phonetic Encoding Algorithms

ISBN/ISSN: 

0005-1179

DOI: 

10.1134/S0005117920100082

Наименование источника: 

  • Automation and Remote Control

Обозначение и номер тома: 

Vol. 81, No 10

Город: 

  • Moscow

Издательство: 

  • Pleiades Publishing, Ltd.

Год издания: 

2020

Страницы: 

1896-1910
Аннотация
This paper presents an overview of the phonetic encoding algorithms designed to determine the similarity of words in sound (pronunciation). Phonetic encoding algorithms are divided into the algorithms for comparing words and the algorithms for determining the distance between words. Word comparison algorithms, such as SoundEx, NYSIIS, Daitch–Mokotoff, Metaphone, and Polyphone, as well as algorithms for determining the distance between words, such as Levenshtein, Jaro, and N-grams, are described. For each algorithm, the advantages and shortcomings are discussed, and an analog for the Russian language is given. For eliminating the common shortcomings of phonetic encoding algorithms, the idea suggested in this paper is to use not the letter sequences of words, but the sequences of their elementary sounds. In this case, word recognition, record linkage, and word indexing by sounds are expected to improve.

Библиографическая ссылка: 

Выхованец В.С., Ду Ц.Н., Сакулин С.А. An Overview of Phonetic Encoding Algorithms // Automation and Remote Control. 2020. Vol. 81, No 10. С. 1896-1910.

Публикация имеет версию на другом языке или вышла в другом издании, например, в электронной (или онлайн) версии журнала: 

Да

Связь с публикацией: