Probability Density Estimation through Nonparametric Adaptive Partitioning and Stitching

التفاصيل البيبلوغرافية
العنوان: Probability Density Estimation through Nonparametric Adaptive Partitioning and Stitching
المؤلفون: Jacobs, Zach D. Merino, Jenny Farmer, Donald J.
المصدر: Algorithms; Volume 16; Issue 7; Pages: 310
بيانات النشر: Multidisciplinary Digital Publishing Institute, 2023.
سنة النشر: 2023
مصطلحات موضوعية: nonparametric density estimation, adaptive data partitioning, branching tree, load balancing, heavy tails, divergent distributions, multi-modal distributions, adaptive kernel density, maximum entropy
الوصف: We present a novel nonparametric adaptive partitioning and stitching (NAPS) algorithm to estimate a probability density function (PDF) of a single variable. Sampled data is partitioned into blocks using a branching tree algorithm that minimizes deviations from a uniform density within blocks of various sample sizes arranged in a staggered format. The block sizes are constructed to balance the load in parallel computing as the PDF for each block is independently estimated using the nonparametric maximum entropy method (NMEM) previously developed for automated high throughput analysis. Once all block PDFs are calculated, they are stitched together to provide a smooth estimate throughout the sample range. Each stitch is an averaging process over weight factors based on the estimated cumulative distribution function (CDF) and a complementary CDF that characterize how data from flanking blocks overlap. Benchmarks on synthetic data show that our PDF estimates are fast and accurate for sample sizes ranging from 29 to 227, across a diverse set of distributions that account for single and multi-modal distributions with heavy tails or singularities. We also generate estimates by replacing NMEM with kernel density estimation (KDE) within blocks. Our results indicate that NAPS(NMEM) is the best-performing method overall, while NAPS(KDE) improves estimates near boundaries compared to standard KDE.
وصف الملف: application/pdf
اللغة: English
تدمد: 1999-4893
DOI: 10.3390/a16070310
URL الوصول: https://explore.openaire.eu/search/publication?articleId=multidiscipl::b94c63220de325a426d2bb5e2becdf83
حقوق: OPEN
رقم الأكسشن: edsair.multidiscipl..b94c63220de325a426d2bb5e2becdf83
قاعدة البيانات: OpenAIRE
الوصف
تدمد:19994893
DOI:10.3390/a16070310