تقرير
Evaluation and Continual Improvement for an Enterprise AI Assistant
العنوان: | Evaluation and Continual Improvement for an Enterprise AI Assistant |
---|---|
المؤلفون: | Maharaj, Akash V., Qian, Kun, Bhattacharya, Uttaran, Fang, Sally, Galatanu, Horia, Garg, Manas, Hanessian, Rachel, Kapoor, Nishant, Russell, Ken, Vaithyanathan, Shivakumar, Li, Yunyao |
سنة النشر: | 2024 |
المجموعة: | Computer Science |
مصطلحات موضوعية: | Computer Science - Human-Computer Interaction |
الوصف: | The development of conversational AI assistants is an iterative process with multiple components. As such, the evaluation and continual improvement of these assistants is a complex and multifaceted problem. This paper introduces the challenges in evaluating and improving a generative AI assistant for enterprises, which is under active development, and how we address these challenges. We also share preliminary results and discuss lessons learned. Comment: Accepted to DaSH Workshop at NAACL 2024 |
نوع الوثيقة: | Working Paper |
URL الوصول: | http://arxiv.org/abs/2407.12003 |
رقم الأكسشن: | edsarx.2407.12003 |
قاعدة البيانات: | arXiv |
الوصف غير متاح. |