تقرير
DWReCO at CheckThat! 2023: Enhancing Subjectivity Detection through Style-based Data Sampling
العنوان: | DWReCO at CheckThat! 2023: Enhancing Subjectivity Detection through Style-based Data Sampling |
---|---|
المؤلفون: | Schlicht, Ipek Baris, Khellaf, Lynn, Altiok, Defne |
سنة النشر: | 2023 |
المجموعة: | Computer Science |
مصطلحات موضوعية: | Computer Science - Computation and Language, Computer Science - Computers and Society, Computer Science - Machine Learning |
الوصف: | This paper describes our submission for the subjectivity detection task at the CheckThat! Lab. To tackle class imbalances in the task, we have generated additional training materials with GPT-3 models using prompts of different styles from a subjectivity checklist based on journalistic perspective. We used the extended training set to fine-tune language-specific transformer models. Our experiments in English, German and Turkish demonstrate that different subjective styles are effective across all languages. In addition, we observe that the style-based oversampling is better than paraphrasing in Turkish and English. Lastly, the GPT-3 models sometimes produce lacklustre results when generating style-based texts in non-English languages. Comment: Accepted to CLEF CheckThat! Lab |
نوع الوثيقة: | Working Paper |
URL الوصول: | http://arxiv.org/abs/2307.03550 |
رقم الأكسشن: | edsarx.2307.03550 |
قاعدة البيانات: | arXiv |
الوصف غير متاح. |