Prompt Design Matters for Computational Social Science Tasks but in Unpredictable Ways

التفاصيل البيبلوغرافية
العنوان:	Prompt Design Matters for Computational Social Science Tasks but in Unpredictable Ways
المؤلفون:	Atreja, Shubham, Ashkinaze, Joshua, Li, Lingyao, Mendelsohn, Julia, Hemphill, Libby
سنة النشر:	2024
المجموعة:	Computer Science
مصطلحات موضوعية:	Computer Science - Artificial Intelligence, Computer Science - Computers and Society
الوصف:	Manually annotating data for computational social science tasks can be costly, time-consuming, and emotionally draining. While recent work suggests that LLMs can perform such annotation tasks in zero-shot settings, little is known about how prompt design impacts LLMs' compliance and accuracy. We conduct a large-scale multi-prompt experiment to test how model selection (ChatGPT, PaLM2, and Falcon7b) and prompt design features (definition inclusion, output type, explanation, and prompt length) impact the compliance and accuracy of LLM-generated annotations on four CSS tasks (toxicity, sentiment, rumor stance, and news frames). Our results show that LLM compliance and accuracy are highly prompt-dependent. For instance, prompting for numerical scores instead of labels reduces all LLMs' compliance and accuracy. The overall best prompting setup is task-dependent, and minor prompt changes can cause large changes in the distribution of generated labels. By showing that prompt design significantly impacts the quality and distribution of LLM-generated annotations, this work serves as both a warning and practical guide for researchers and practitioners. Comment: under review
نوع الوثيقة:	Working Paper
URL الوصول:	http://arxiv.org/abs/2406.11980
رقم الأكسشن:	edsarx.2406.11980
قاعدة البيانات:	arXiv

الوصف
الوصف غير متاح.