82325

Автор(ы): 

Автор(ов): 

4

Параметры публикации

Тип публикации: 

Статья в журнале/сборнике

Название: 

Search for Near-Duplicate Handwritten Documents for Data-Intensive Applications

ISBN/ISSN: 

1064-2307

DOI: 

10.1134/S1064230724700503

Наименование источника: 

  • Journal of Computer and Systems Sciences International

Обозначение и номер тома: 

V.63 № 4

Город: 

  • Нью-Йорк

Издательство: 

  • PLEIADES PUBLISHING,Ltd

Год издания: 

2024

Страницы: 

687-694
Аннотация
The problem of cheating in handwritten academic essays has become more significant over the past few years. One type of cheating involves submitting the same paper, photographed in a different environment (for example, from another angle, in a different light, or in lower quality) or changed by automatic augmentation. The existing methods for detecting near-duplicates are not designed to work on large collections of handwritten documents, which significantly limits their use in practice. A machine learning-based method is presented that enables the detection of near-duplicate handwritten text images among large collections of potential sources. The proposed approach consists of three stages: converting the image into a vector representation, searching for candidates, and then selecting the source of duplication among the candidates. Our method achieved 80% and 59% recall-at-1 with false positive rate of 4.8% and 5.5% on Synthetic and Real data, respectively. The search latency is 5.5 seconds per query for a collection of 10 000 images. The results showed that the developed method is sufficiently robust to solve problems that require checking large collections of handwritten documents for cheating.

Библиографическая ссылка: 

Варламова К.Д., Каприелова М.С., Потяшин И.О., Чехович Ю.В. Search for Near-Duplicate Handwritten Documents for Data-Intensive Applications // Journal of Computer and Systems Sciences International. 2024. V.63 № 4. С. 687-694.

Публикация имеет версию на другом языке или вышла в другом издании, например, в электронной (или онлайн) версии журнала: 

Да

Связь с публикацией: