HC3 Plus: A Semantic-Invariant Human ChatGPT Comparison Corpus

التفاصيل البيبلوغرافية
العنوان: HC3 Plus: A Semantic-Invariant Human ChatGPT Comparison Corpus
المؤلفون: Su, Zhenpeng, Wu, Xing, Zhou, Wei, Ma, Guangyuan, Hu, Songlin
سنة النشر: 2023
المجموعة: Computer Science
مصطلحات موضوعية: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
الوصف: ChatGPT has gained significant interest due to its impressive performance, but people are increasingly concerned about its potential risks, particularly around the detection of AI-generated content (AIGC), which is often difficult for untrained humans to identify. Current datasets utilized for detecting ChatGPT-generated text primarily center around question-answering, yet they tend to disregard tasks that possess semantic-invariant properties, such as summarization, translation, and paraphrasing. Our primary studies demonstrate that detecting model-generated text on semantic-invariant tasks is more difficult. To fill this gap, we introduce a more extensive and comprehensive dataset that considers more types of tasks than previous work, including semantic-invariant tasks. In addition, the model after a large number of task instruction fine-tuning shows a strong powerful performance. Owing to its previous success, we further instruct fine-tuning T\textit{k}-instruct and build a more powerful detection system.
Comment: This paper has been accepted by CIKM2023 workshop
نوع الوثيقة: Working Paper
URL الوصول: http://arxiv.org/abs/2309.02731
رقم الأكسشن: edsarx.2309.02731
قاعدة البيانات: arXiv