Empirical analysis of variance for genetic programming based symbolic regression

التفاصيل البيبلوغرافية
العنوان: Empirical analysis of variance for genetic programming based symbolic regression
المؤلفون: Gabriel Kronberger, Stephan M. Winkler, Lukas Kammerer
المصدر: GECCO Companion
بيانات النشر: ACM, 2021.
سنة النشر: 2021
مصطلحات موضوعية: Polynomial, Statistics, Benchmark (computing), Genetic programming, Sensitivity (control systems), Variance (accounting), Bias–variance tradeoff, Symbolic regression, Mathematics, Random forest
الوصف: Genetic programming (GP) based symbolic regression is a stochastic, high-variance algorithm. Its sensitivity to changes in training data is a drawback for practical applications. In this work, we analyze empirically the variance of GP models on the PennML benchmarks. We measure the spread of model predictions when models are trained on slightly perturbed data. We compare the spread of models from two GP variants as well as linear, polynomial and random forest regression models. The results show that the spread of models from GP with local optimization is significantly higher than that of all other algorithms. As a side effect of our analysis, we provide evidence that the PennML benchmark contains two groups of instances (Friedman and real-world problem instances) for which GP performs significantly different.
URL الوصول: https://explore.openaire.eu/search/publication?articleId=doi_________::9d5a4870a0dd5174f091acfa1ffc9683
https://doi.org/10.1145/3449726.3459486
رقم الأكسشن: edsair.doi...........9d5a4870a0dd5174f091acfa1ffc9683
قاعدة البيانات: OpenAIRE