A survey and benchmark of high-dimensional Bayesian optimization of discrete sequences

التفاصيل البيبلوغرافية
العنوان: A survey and benchmark of high-dimensional Bayesian optimization of discrete sequences
المؤلفون: González-Duque, Miguel, Michael, Richard, Bartels, Simon, Zainchkovskyy, Yevgen, Hauberg, Søren, Boomsma, Wouter
سنة النشر: 2024
المجموعة: Computer Science
Statistics
مصطلحات موضوعية: Computer Science - Machine Learning, Statistics - Machine Learning
الوصف: Optimizing discrete black-box functions is key in several domains, e.g. protein engineering and drug design. Due to the lack of gradient information and the need for sample efficiency, Bayesian optimization is an ideal candidate for these tasks. Several methods for high-dimensional continuous and categorical Bayesian optimization have been proposed recently. However, our survey of the field reveals highly heterogeneous experimental set-ups across methods and technical barriers for the replicability and application of published algorithms to real-world tasks. To address these issues, we develop a unified framework to test a vast array of high-dimensional Bayesian optimization methods and a collection of standardized black-box functions representing real-world application domains in chemistry and biology. These two components of the benchmark are each supported by flexible, scalable, and easily extendable software libraries (poli and poli-baselines), allowing practitioners to readily incorporate new optimization objectives or discrete optimizers. Project website: https://machinelearninglifescience.github.io/hdbo_benchmark
نوع الوثيقة: Working Paper
URL الوصول: http://arxiv.org/abs/2406.04739
رقم الأكسشن: edsarx.2406.04739
قاعدة البيانات: arXiv