Application of winnowing algorithm in development of lecturer research performance information system.

التفاصيل البيبلوغرافية
العنوان: Application of winnowing algorithm in development of lecturer research performance information system.
المؤلفون: Pratama, Ramadhani Noor, Najwaini, Effan, Rozaq, Abdul
المصدر: AIP Conference Proceedings; 2023, Vol. 2693 Issue 1, p1-15, 15p
مصطلحات موضوعية: PRIME numbers, INFORMATION storage & retrieval systems, RESEARCH & development, ALGORITHMS, DATABASES
مستخلص: In this research, a system is made that can check the similarity using the winnowing algorithm. This algorithm will detect the similarity of research titles which is helpful to prevent duplication of research and search for references from similar studies. In this system, the user can check a title by inputting it on the form provided; then, the system will check the title's similarity to all titles that have been stored in the database. The application of the winnowing algorithm requires parameter values of N-Gram, Window, and Prime Numbers. These three parameters will have an impact on the results of checking the similarity of the winnowing algorithm. This study focuses on checking the similarity of the title, which has far fewer words than the entire article content. This comparison of similarities to sentences with a small number of words makes determining the three winnowing parameter values very important. This study conducted a trial to obtain the optimal parameter values. This study concluded that the higher the N-Gram value, the smaller the percentage of similarity for the same window. At the same N-Gram, the higher the window value, the similarity percentage does not change significantly but has a decreasing trend. Based on the test results, the optimal prime number value is 23; The optimal N-Gram is 5; the optimal window is 2. [ABSTRACT FROM AUTHOR]
Copyright of AIP Conference Proceedings is the property of American Institute of Physics and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
قاعدة البيانات: Complementary Index
الوصف
تدمد:0094243X
DOI:10.1063/5.0118709