Evaluation and Continual Improvement for an Enterprise AI Assistant

التفاصيل البيبلوغرافية
العنوان: Evaluation and Continual Improvement for an Enterprise AI Assistant
المؤلفون: Maharaj, Akash V., Qian, Kun, Bhattacharya, Uttaran, Fang, Sally, Galatanu, Horia, Garg, Manas, Hanessian, Rachel, Kapoor, Nishant, Russell, Ken, Vaithyanathan, Shivakumar, Li, Yunyao
سنة النشر: 2024
المجموعة: Computer Science
مصطلحات موضوعية: Computer Science - Human-Computer Interaction
الوصف: The development of conversational AI assistants is an iterative process with multiple components. As such, the evaluation and continual improvement of these assistants is a complex and multifaceted problem. This paper introduces the challenges in evaluating and improving a generative AI assistant for enterprises, which is under active development, and how we address these challenges. We also share preliminary results and discuss lessons learned.
Comment: Accepted to DaSH Workshop at NAACL 2024
نوع الوثيقة: Working Paper
URL الوصول: http://arxiv.org/abs/2407.12003
رقم الأكسشن: edsarx.2407.12003
قاعدة البيانات: arXiv