SELF-[IN]CORRECT: LLMs Struggle with Refining Self-Generated Responses

التفاصيل البيبلوغرافية
العنوان: SELF-[IN]CORRECT: LLMs Struggle with Refining Self-Generated Responses
المؤلفون: Jiang, Dongwei, Zhang, Jingyu, Weller, Orion, Weir, Nathaniel, Van Durme, Benjamin, Khashabi, Daniel
سنة النشر: 2024
المجموعة: Computer Science
مصطلحات موضوعية: Computer Science - Artificial Intelligence, Computer Science - Computation and Language, Computer Science - Machine Learning
الوصف: Can LLMs continually improve their previous outputs for better results? An affirmative answer would require LLMs to be better at discriminating among previously-generated alternatives, than generating initial responses. We explore the validity of this hypothesis in practice. We first introduce a unified framework that allows us to compare the generative and discriminative capability of any model on any task. Then, in our resulting experimental analysis of several LLMs, we do not observe the performance of those models on discrimination to be reliably better than generation. We hope these findings inform the growing literature on self-improvement AI systems.
نوع الوثيقة: Working Paper
URL الوصول: http://arxiv.org/abs/2404.04298
رقم الأكسشن: edsarx.2404.04298
قاعدة البيانات: arXiv