The method of search for falsifications in copies of contractual documents based on N-grams

التفاصيل البيبلوغرافية
العنوان: The method of search for falsifications in copies of contractual documents based on N-grams
المؤلفون: Elena Andreeva, Vladimir V. Arlazarov, Oleg Slavin
المصدر: Thirteenth International Conference on Machine Vision.
بيانات النشر: SPIE, 2021.
سنة النشر: 2021
مصطلحات موضوعية: Matching (statistics), Information retrieval, Computer science, media_common.quotation_subject, Quality (business), Business documents, Banking sector, Reliability (statistics), Word (computer architecture), media_common, Task (project management)
الوصف: This article is focused on methods of search for falsifications in scanned copies of business documents. This task arises from a comparison of two copies of business documents signed by two parties. The comparison should be performed to detect possible changes made by one of the parties. This problem is relevant, for instance, in the banking sector when signing agreements on paper. The method of partial search for matching flexible documents, where text attributes may be changed, and unintentional modifications of non-essential words may be made is considered. The method of comparison of two scanned images based on the recognition and analysis of N-grams word sequences is proposed. The proposed method has been tested on private dataset. The proposed method has demonstrated high quality and reliability of the search for differences in two samples of one agreement-type document.
URL الوصول: https://explore.openaire.eu/search/publication?articleId=doi_________::7c3aeadc9718f4cfd00ec5581cb9dfde
https://doi.org/10.1117/12.2587026
رقم الأكسشن: edsair.doi...........7c3aeadc9718f4cfd00ec5581cb9dfde
قاعدة البيانات: OpenAIRE