An investigation on the use of Large Language Models for hyperparameter tuning in Evolutionary Algorithms

التفاصيل البيبلوغرافية
العنوان: An investigation on the use of Large Language Models for hyperparameter tuning in Evolutionary Algorithms
المؤلفون: Custode, Leonardo Lucio, Caraffini, Fabio, Yaman, Anil, Iacca, Giovanni
سنة النشر: 2024
المجموعة: Computer Science
مصطلحات موضوعية: Computer Science - Neural and Evolutionary Computing
الوصف: Hyperparameter optimization is a crucial problem in Evolutionary Computation. In fact, the values of the hyperparameters directly impact the trajectory taken by the optimization process, and their choice requires extensive reasoning by human operators. Although a variety of self-adaptive Evolutionary Algorithms have been proposed in the literature, no definitive solution has been found. In this work, we perform a preliminary investigation to automate the reasoning process that leads to the choice of hyperparameter values. We employ two open-source Large Language Models (LLMs), namely Llama2-70b and Mixtral, to analyze the optimization logs online and provide novel real-time hyperparameter recommendations. We study our approach in the context of step-size adaptation for (1+1)-ES. The results suggest that LLMs can be an effective method for optimizing hyperparameters in Evolution Strategies, encouraging further research in this direction.
Comment: Proceedings of the GECCO'24 Companion: Genetic and Evolutionary Computation Conference Companion
نوع الوثيقة: Working Paper
DOI: 10.1145/3638530.3664163
URL الوصول: http://arxiv.org/abs/2408.02451
رقم الأكسشن: edsarx.2408.02451
قاعدة البيانات: arXiv