Shape-constrained Symbolic Regression -- Improving Extrapolation with Prior Knowledge

التفاصيل البيبلوغرافية
العنوان: Shape-constrained Symbolic Regression -- Improving Extrapolation with Prior Knowledge
المؤلفون: F. O. de Franca, Michael Kommenda, Gabriel Kronberger, Christian Haider, Bogdan Burlacu
سنة النشر: 2021
مصطلحات موضوعية: FOS: Computer and information sciences, Polynomial regression, Computer science, Evolutionary algorithm, Computer Science - Neural and Evolutionary Computing, Machine Learning (stat.ML), Genetic programming, Function (mathematics), Biological Evolution, Interval arithmetic, Set (abstract data type), Computational Mathematics, Statistics - Machine Learning, Test set, Neural and Evolutionary Computing (cs.NE), Symbolic regression, Algorithm, Algorithms
الوصف: We investigate the addition of constraints on the function image and its derivatives for the incorporation of prior knowledge in symbolic regression. The approach is called shape-constrained symbolic regression and allows us to enforce, for example, monotonicity of the function over selected inputs. The aim is to find models which conform to expected behavior and which have improved extrapolation capabilities. We demonstrate the feasibility of the idea and propose and compare two evolutionary algorithms for shape-constrained symbolic regression: (i) an extension of tree-based genetic programming which discards infeasible solutions in the selection step, and (ii) a two-population evolutionary algorithm that separates the feasible from the infeasible solutions. In both algorithms we use interval arithmetic to approximate bounds for models and their partial derivatives. The algorithms are tested on a set of 19 synthetic and four real-world regression problems. Both algorithms are able to identify models which conform to shape constraints which is not the case for the unmodified symbolic regression algorithms. However, the predictive accuracy of models with constraints is worse on the training set and the test set. Shape-constrained polynomial regression produces the best results for the test set but also significantly larger models.
اللغة: English
URL الوصول: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::36db28c0f5c48e925505e4c21af1bd3d
http://arxiv.org/abs/2103.15624
حقوق: OPEN
رقم الأكسشن: edsair.doi.dedup.....36db28c0f5c48e925505e4c21af1bd3d
قاعدة البيانات: OpenAIRE