دورية أكاديمية

Efficient estimation of the cardinality of large data sets

التفاصيل البيبلوغرافية
العنوان: Efficient estimation of the cardinality of large data sets
المؤلفون: Philippe Chassaing, Lucas Gerin
المصدر: Discrete Mathematics & Theoretical Computer Science, Vol DMTCS Proceedings vol. AG,..., Iss Proceedings (2006)
بيانات النشر: Discrete Mathematics & Theoretical Computer Science, 2006.
سنة النشر: 2006
المجموعة: LCC:Mathematics
مصطلحات موضوعية: cardinality, large multiset, approximate counting, [info.info-ds] computer science [cs]/data structures and algorithms [cs.ds], [info.info-dm] computer science [cs]/discrete mathematics [cs.dm], [math.math-co] mathematics [math]/combinatorics [math.co], Mathematics, QA1-939
الوصف: Giroire has recently proposed an algorithm which returns the $\textit{approximate}$ number of distinct elements in a large sequence of words, under strong constraints coming from the analysis of large data bases. His estimation is based on statistical properties of uniform random variables in $[0,1]$. In this note we propose an optimal estimation, using Kullback information and estimation theory.
نوع الوثيقة: article
وصف الملف: electronic resource
اللغة: English
تدمد: 1365-8050
Relation: https://dmtcs.episciences.org/3492/pdf; https://doaj.org/toc/1365-8050
DOI: 10.46298/dmtcs.3492
URL الوصول: https://doaj.org/article/61f81c04330c40319deca61f58fe4e3e
رقم الأكسشن: edsdoj.61f81c04330c40319deca61f58fe4e3e
قاعدة البيانات: Directory of Open Access Journals
الوصف
تدمد:13658050
DOI:10.46298/dmtcs.3492