Online Optimization and Learning in Uncertain Dynamical Environments with Performance Guarantees

التفاصيل البيبلوغرافية
العنوان: Online Optimization and Learning in Uncertain Dynamical Environments with Performance Guarantees
المؤلفون: Li, Dan, Fooladivanda, Dariush, Martinez, Sonia
بيانات النشر: arXiv, 2021.
سنة النشر: 2021
مصطلحات موضوعية: FOS: Computer and information sciences, Computer Science - Machine Learning, Machine Learning (stat.ML), Systems and Control (eess.SY), Dynamical Systems (math.DS), Electrical Engineering and Systems Science - Systems and Control, Statistics - Applications, Machine Learning (cs.LG), Statistics - Machine Learning, Optimization and Control (math.OC), FOS: Electrical engineering, electronic engineering, information engineering, FOS: Mathematics, Applications (stat.AP), Mathematics - Dynamical Systems, Mathematics - Optimization and Control
الوصف: We propose a new framework to solve online optimization and learning problems in unknown and uncertain dynamical environments. This framework enables us to simultaneously learn the uncertain dynamical environment while making online decisions in a quantifiably robust manner. The main technical approach relies on the theory of distributional robust optimization that leverages adaptive probabilistic ambiguity sets. However, as defined, the ambiguity set usually leads to online intractable problems, and the first part of our work is directed to find reformulations in the form of online convex problems for two sub-classes of objective functions. To solve the resulting problems in the proposed framework, we further introduce an online version of the Nesterov accelerated-gradient algorithm. We determine how the proposed solution system achieves a probabilistic regret bound under certain conditions. Two applications illustrate the applicability of the proposed framework.
DOI: 10.48550/arxiv.2102.09111
URL الوصول: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::b41984237c2fa522b3d1c5286622a3af
حقوق: OPEN
رقم الأكسشن: edsair.doi.dedup.....b41984237c2fa522b3d1c5286622a3af
قاعدة البيانات: OpenAIRE
الوصف
DOI:10.48550/arxiv.2102.09111